SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: JACOBSON, Myron K . ; JACOBSON, Elaine L.; AME, Jean- 

Christophe; LIN, Winston 

(ii) TITLE OF INVENTION: GENES ENCODING SEVERAL POLY ( ADP-RIBOSE ) 
GLYCOHYDROLASE (PARG) ENZYMES , THE PROTEINS AND FRAGMENTS THEREOF, AND 
ANTIBODIES IMMUNOREACT IVE THEREWITH 

(iii) NUMBER OF SEQUENCES: 38 

(iv) CORRESPONDENCE ADDRESS: 



UJ 



(A) 


ADDRESSEE: 


Fulbright & Jaworski L.L.P. 


(B) 


STREET : 


666 Fifth Avenue 


(C) 


CITY : 


New York 


CD) 


STATE: 


New York 


CE) 


COUNTRY : 


USA 


(F) 


ZIP : 


10103 



(v) COMPUTER READABLE FORM: 
yi (A) MEDIUM TYPE: 3.5 inch 1.44 Mb storage diskette 

(B) COMPUTER: IBM PS/2 

(C) OPERATING SYSTEM: PC-DOS 

(D) SOFTWARE: Wordperfect 



^ (A) APPLICATION NUMBER: To Be Assigned 

-Jar 
; c 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: Concurrently Herewith 



Iviii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: John E. Lynch 

(B) REGISTRATION NUMBER: 20,940 

(C) REFERENCE/ DOCKET NUMBER: NIAD 201-JEL/ES 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212-318-3000 

(B) TELEFAX: 212-752-5958 
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(2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION,: SEQ ID NO: 1: 



1 accggaaagt 
61 tggaagaggg 
121 ctgcagaagc 
181 agtccgtccc 
24 1 gtgctgcggg 
301 gctgggacgc 
361 ggcagaggcg 
421 cgtcaggctg 
481 tcaaacagaa 
541 aaagtttgca 
601 aaaaagataa 
661 ttgataagtc 
721 tgtgtaagtg 
781 cggtaactct 
84 1 atgatcacag 
901 taaagcttgc 
961 accagaagtg 
1021 agacagacgt 
1081 gactgaaaaa 
1141 ttgagaaaga 
1201 aggattcaga 
1261 ctgctcaaac 
1321 ggaagcggtc 
1381 agagtcgagc 
1441 atgtagagtg 
1501 tcatgagagt 
1561 gaacagaaag 
1621 ttggaactcc 
1681 tgagaccatc 
1741 ttcctaaacc 
1801 cttgttcaga 
1861 gccggtggga 
1921 aggatgctat 
1981 ttgatttctg 
2041 tgcctgatat 
2101 tcctgaaaca 
2161 tagctaatgc 
2221 ccagttatcc 
2281 agaagcttaa 
2341 tggtgacatt 
2401 tcctgactcg 
2461 tacaggtgga 
2521 aagaagaaat 
2581 tgctggatca 
2641 gctatgccga 
2701 ggcagaggcg 
2761 accagtttgt 
2821 gtcctggagt 
2881 cctttggggg 
2941 agcgagacgt 
3001 tgcatacatt 
3061 gatattacaa 



gaacgaagcc 
aaggcaggcg 
agtcagcggc 
tcccagggtt 
tcccagcatg 
cgctgcaact 
cgtcctcgat 
cgccctgggc 
gactataacc 
tagtaaagaa 
cttttatcaa 
accagttgaa 
gcagaatgaa 
ggtaccagag 
tgacacaaat 
gaatgcaaag 
tggcaaggct 
ggtgtccgag 
tgccaacaga 
aagt gaacct 
agcagatgaa 
agcaaataaa 
ctctgctaag 
tggaatgaat 
cagaaattct 
gcccaaagca 
gaagatccct 
tattgaggag 
tgccaatcac 
tttcccaaca 
acaaaacttg 
actcattcag 
tctgaagtac 
ggataaggta 
ggtgaaaatt 
gaagatgaat 
tttcttctgc 
agatattaac 
aacgctcttc 
cacaagacag 
actgcatgtc 
ttttgcaaac 
ccgcttttta 
caatgaatgt 
aacataccgc 
cacgactgag 
gcccgagaag 
ttcttcagag 
tgatgctaga 
ggtttatttc 
cctcactgag 
tgaagaatgc 



cgaatcagaa 
tctggatagg 
agagggggca 
agtgaatgag 
agtgcgggcc 
tctccgccgg 
tccaaggacg 
cgggcgggac 
agt tgga tgg 
aacaacaata 
cataacatgg 
aaaggtacac 
gggccacact 
cagttcagta 
agtgaggaga 
cagacgatgg 
tgccatcctg 
agccccttgt 
ttgaat agac 
gagtcaccaa 
gagacaagtc 
ccttcaaggt 
ggaggtgaga 
gatgtgaatg 
aagcaacatg 
gaggacaaaa 
aaatacattc 
atgaggagaa 
acagtgacta 
cattttaaag 
taccctgtgg 
actgcacttc 
aatgtggcat 
ctagaagaag 
gcactctgtc 
cattccatca 
acgtttccac 
ttcaatcggt 
tgctacttta 
agtcttgaag 
acttacgaag 
cgtttcgttg 
atcaaccctg 
cttatcatca 
tgggcccgga 
atcgtcgcca 
atcagacggg 
aacctgtctg 
ctaaaagcct 
acctttgggg 
aggaaactga 
agaaactgct 



cggctcatcc 
gcctggttcg 
tggtgccggg 
gctctacgcc 
ccggctgtga 
ccgcctcgga 
ctccggtgca 
agcaccgagg 
acactaaagg 
caagagaaga 
aaaaattaga 
agtatttgaa 
cagaacggct 
atgctaatgt 
gtagagataa 
aagatgaaca 
cagaagcctg 
cggacactgg 
aagaaagtag 
tggatgtaga 
caggttttga 
tccaaccaag 
ttcgattaca 
ccaaacgacc 
ggagaaagga 
gaaaagaaca 
cacctcacct 
tgccaaggtg 
ttcgggtaga 
atttgtggga 
aagatgagaa 
tcaacaggct 
attctaagaa 
cagaagctca 
tgccaaatat 
caatgtcaca 
gacgcaatgc 
tgtttgaagg 
gaagagtcac 
attttccaga 
gtaccataga 
gaggtggtgt 
agttgattgt 
caggtactga 
gccatgaaga 
tcgacgccct 
agcttaacaa 
cagtggctac 
taatacagat 
actcagaact 
ctgttggaga 
ccacccccgg 



tgaggctggt 
ggaggctgtc 
aggcaccgag 
cgggctggcc 
gccctgcacc 
cgcccggagc 
gttcagggtc 
cagcgccacc 
aatcaagaca 
atccatgatg 
aaatgtttct 
gcagcatcag 
tttggaaagt 
cgatcagtcg 
tcagcagttt 
gggcagagaa 
tgcagggtgt 
ctctgaggat 
tctaggaaat 
taattccaaa 
tgaacaggaa 
agaagctgac 
tttccaattt 
tggaagtact 
ttctaaaatc 
atgtgaaatg 
ttctccagat 
tgggatccgg 
tcttttgcga 
caacaagcat 
tggtgagcga 
cactcggccc 
atgggacttt 
acacttgtat 
ttgtacccag 
ggaacagatt 
caagatgaaa 
acgttcatca 
agagaaaaaa 
gtgggaaaga 
aggaaacggc 
aaccagtgca 
t tcacggctc 
gcagtacagt 
caggagcgaa 
ccact tcaga 
ggcttactgt 
aggaaactgg 
cctggcagct 
gatgagagac 
agtatataag 
accagacatc 



agggtgccgg 
agagcaggag 
gagggggcgc 
cggagactca 
aagcgacccc 
ttccccggca 
ccgccgtcct 
tctcttgttt 
gttgaatcag 
agttctgtac 
cagctaggtt 
actgcggcta 
gaacctccag 
tccccaaagg 
ttgacacatg 
gccagaagcc 
cagcaggagg 
gttggtactg 
tct cct ccat 
aatagttgtc 
gatagcagtt 
actgagttga 
gaaggaggag 
tctagcctga 
acagatcatt 
aaacatcaaa 
aagaaatggc 
ctgcctccct 
ataggagaag 
gttaagatgc 
gctgcaggca 
cagaacctga 
acagctttga 
cagtccatct 
ccaataccac 
gccagtcttt 
tcagagtatt 
aggaaaccag 
cccactgggt 
tgtgaaaaac 
cagggcatgc 
ggacttgtgc 
ttcactgagg 
gaatacacag 
agggacgact 
cgctacctcg 
ggatttcttc 
ggctgtggtg 
gctgtagctg 
atttacagca 
ctgctgctac 
aagctttatc 
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3121 cattcatata ccatgcagtt gagtcctgta cacagaccac caaccagccg ggacaaagga 
318 1 cgggggcctg aggagccaag tgactagacg ctccccactt gtgtaacaag aaggtgtgac 
3241 gtgtgaactg acatgatatc catgtgtata taatccgcgt ttgtaggcaa ggatgcagtc 
3301 ccttccgccc atgcagctgt cagtacatct gcgcctcctc catcccgact tacatagact 
3361 gagacatact ttgtttcttt ttttttctat ttcagccctg attcttttat ttttctttct 
3421 tttgcccatc agacttcttg tgaaatttca tcagagtttg tgctcagcct ggcaggtgtc 
3481 ttttttgatg cctaaatata caaatcacct ctgcagctag cagatgccac ggaaggtggt 
354 1 ggaaccctag gagctgtaac tgagtctgct gcagatctcc ctctgagcct ctcaccccta 
3601 ccctattatc attgtggtgg tggaggtttt ttgatttttg aaataagagt tgggtttgtt 
3661 aaataataca gatctcctag gttaagagtt ttatatttaa gaatactttt caaaaagtta 
3721 ttttgagata tcacctttat ttgtaatggt aatttgcctg tcccttttcc cctgatcaat 
37 8 1 ttgtattgac tgtttttgga aattgaccca aatgaaagga aatatgagaa taagagtttc 
3841 ccaaa-tggtg tttaaaaaca aacaggttca agacacgcga aggacctcgt ttcctgggat 
3901 tttttttctt tttctttttt tgaattagga ttattgtttg ttccttggtg cttgagacat 
3961 attcatataa ccaaagttta ggaactggga acttcgtggt gatttgtaca tattgaagtt 
4 02 1 tctctggtac tcaaaggtta tgtagttaat aaattttcat taacaaaaaa 



(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 977 amino acids 

(B) TYPE: protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



1 MSAGPGCEPC 

61 GRAGQHRGSA 

121 QHNMEKLENV 

181 EQFSNANVDQ 

241 ACHPAEACAG 

301 PESPMDVDNS 

3 61 KGGEIRLHFQ 

421 AEDKRKEQCE 

481 HTVTIRVDLL 

541 QTALLNRLTR 

601 IALCLPNICT 

661 NFNRLFEGRS 

721 VTYEGTIEGN 

781 CLI ITGTEQY 

841 KIRRELNKAY 

901 FT FGDSELMR 

961 VESCTQTTNQ 



TKRPRWDAAA 
TSLVFKQKTI 
SQLGFDKSPV 
SSPKDDHSDT 
CQQEETDVVS 
KNSCQDSEAD 
FEGGESRAGM 
MKHQRTERKI 
RIGEVPKPFP 
PQNLKDAILK 
QPIPLLKQKM 
SRKPEKLKTL 
GQGMLQVDFA 
SEYTGYAETY 
CGFLRPGVSS 
DIYSMHTFLT 
PGQRTGA 



TSPPAASDAR 
TSWMDTKGIK 
EKGTQYLKQH 
NSEESRDNQQ 
ESPLSDTGSE 
EETSPGFDEQ 
NDVNAKRPGS 
PKYIPPHLSP 
THFKDLWDNK 
YNVAYSKKWD 
NHSITMSQEQ 
FCYFRRVTEK 
NRFVGGGVTS 
RWARSHEDRS 
ENLSAVATGN 
ERKLTVGEVY 



SFPGRQRRVL 
TVESESLHSK 
QTAAMCKWQN 
FLTHVKLANA 
DVGTGLKNAN 
EDSSSAQTAN 
TSSLNVECRN 
DKKWLGTPIE 
HVKMPCSEQN 
FTALIDFWDK 
IASLLANAFF 
KPTGLVTFTR 
AGLVQEEIRF 
ERDDWQRRTT 
WGCGAFGGDA 
KLLLRYYNEE 



DSKDAPVQFR 
ENNNTREESM 
EGPHSERLLE 
KQTMEDEQGR 
RLNRQESSLG 
KPSRFQPREA 
SKQHGRKDSK 
EMRRMPRCGI 
LYPVEDENGE 
VLEEAEAQHL 
CTFPRRNAKM 
QSLEDFPEWE 
LINPELIVSR 
EIVAIDALHF 
RLKALIQILA 
CRNCSTPGPD 



VPPSSSGCAL 
MSSVQKDNFY 
SEPPAVTLVP 
EARSHQKCGK 
NSPPFEKESE 
DTELRKRSSA 
ITDHFMRVPK 
RLPPLRPSAN 
RAAGSRWELI 
YQSILPDMVK 
KSEYSSYPDI 
RCEKLLTRLH 
LFTEVLDHNE 
RRYLDQFVPE 
AAVAERDVVY 
IKLYPFIYHA 



(2) INFORMATION FOR SEQ ID NO: 3: 
{i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



1 ggcgtctggg aagtgaggag cgtctctgcc tggcagaggc tgcaatctct gcactttggg 

61 gggccaaggc aggcgctgag aaggacgcgc agtccatctc tctcaggtta gtgaaatgag 

121 gctctccgcg gggccggccc ggggacagtg cgctgctggt cccagcatga atgcgggccc 

181 cggctgtgaa ccctgcacca aagcgacccg ctggggcgcc gctacaactt cgccggctgc 

241 ttcggacgcc cggagctttc cgagcaggca gaggcgcgtc ctcgacccca aggacgctca 

301 cgtgcagttc agggtcccac cgtcctcgcc agcctgcgtc ccagggcagg cgggacagca 
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361 cagaggcagc gccacctcgc ttgttttcaa acaaaagact attaccagtt ggatggacac 
421 taaaggaatc aagacagcgg aatcagaaag tttggatagt aaagaaaaca acaatacaag 
481 aatagaatcc atgatgagtt ctgtacaaaa agataacttt taccaacata atgtagaaaa 
541 attagtaaat gtttctcagc taagtcttga taagtcactc actgaaaaaa gtacacagta 
601 tttgaaccag catcagactg cagcaatgtg taagtggcaa aatgaaggga aacacacgga 
661 gcagcttttg gaaagtgaac ctcaaacagt aaccctggta ccagagcagt ttagtaatgc 
721 taacattgat cggtcacctc aaaatgatga tcacagtgac acagatagtg aagagaatag 
781 agacaatcaa cagtttctca caactgtaaa gcttgcaaat gcaaagcaga ctacggaaga 
841 tgaacacgcc agagaagcca aaagccacca gaagtgcagc aagtcttgcc atcctgggga 
901 agactgtgca agttgtcagc aagatgagat agacgtggtg ccaaagagtc cattgtcaga 
961 tgttggctct gaggatgttg gtactgggtc aaaaaatgac aacaaattga ttagacaaga 
1021 aagttgccta ggaaattctc ctccatttga gaaggaaagt gaacccgaat caccgatgga 
1081 tgtggataat tctaaaaata gttgtcaaga ctcagaagca gatgaggaga caagtccagg 
1141 ttttgatgaa caagaagatg gtagttcctc ccaaacagca aataaacctt caaggttcca 
1201 agcaagagac gctgacattg aatttaggaa acggtactct actaagggcg gtgaagttag 
1261 attacatttc caatttgaag gaggagagag tcgcactgga atgaatgatt taaatgctaa 
1321 actacctgga aatatttcta gcctgaatgt agaatgcaga aattctaagc aacatggaaa 
1381 aaaggattct aaaatcacag atcatttgat gagactgccc aaagcagagg acagaagaaa 
14 4 1 agaacagtgg gaaaccaaac atcaaagaac agaaaggaag atccctaaat acgttccacc 
1501 tcacctttct ccagataaga agtggcttgg aactcccatt gaggagatga gaagaatgcc 
1561 tcggtgtggg atccggctgc ctctcttgag accatctgcc aatcacacag taactattcg 
1 62 1 ggtagatctt ttgcgagcag gagaagttcc taaacctttt ccaacacatt ataaagattt 
1 68 1 gtgggataac aagcatgtta aaatgccttg ttcagaacaa aatttgtacc cagtggaaga 
17 4 1 tgagaatggt gagcgaactg cggggagccg gtgggagctc attcagactg cacttctcaa 
1801 caaatttaca cgaccccaaa acttgaagga tgctattctg aaatacaatg tggcatattc 
18 61 taagaaatgg gactttacag ctttgatcga tttctgggat aaggtacttg aagaagcaga 
1921 agctcaacat ttatatcagt ccatcttgcc tgatatggtg aaaattgcac tctgtctgcc 
1 98 1 aaatatttgc acccagccaa taccactcct gaaacagaag atgaatcatt ccatcacaat 
2 04 1 gtcgcaggaa cagattgcca gtcttttagc taatgctttc ttctgcacat ttccacgacg 
2101 aaatgctaag atgaaatcgg agtattctag ttacccagac attaacttca atcgattgtt 
2161 tgagggacgt tcatcaagga aaccggagaa acttaaaacg ctcttctgct actttagaag 
222 1 agtcacagag aaaaaaccta ctgggttggt gacatttaca agacagagtc ttgaagattt 
228 1 tccagaatgg gaaagatgtg aaaaaccctt gacacgattg catgtcactt acgaaggtac 
2 34 1 catagaagaa aatggccaag gcatgctaca ggtggatttt gcaaatcgtt ttgttggagg 
2401 tggtgtaacc agtgcaggac ttgtgcaaga agaaatccgc tttttaatca atcctgagtt 
2 4 61 gattatttca cggctcttca ctgaggtgct ggatcacaat gaatgtctaa ttatcacagg 
2521 tactgagcag tacagtgaat acacaggcta tgctgagaca tatcgttggt cccggagcca 
2581 cgaagatggg agtgaaaggg acgactgcga gcggcgctgc actgagatcg ttgccatcga 
2 64 1 tgctcttcac ttcagacgct acctcgatca gtttgtgcct gagaaaatga gacgcgagct 
2701 gaacaaggct tactgtggat ttctccgtcc tggagtttct tcagagaatc tttctgcagt 
27 61 ggccacagga aactggggct gtggtgcctt tgggggtgat gccaggttaa aagccttaat 
2821 acagatattg gcagctgctg cagctgagcg agatgtggtt tatttcacct ttggggactc 
288 1 agaattgatg agagacattt acagcatgca cattttcctt actgaaagga aactcactgt 
2 94 1 tggagatgtg tataagctgt tgctacgata ctacaatgaa gaatgcagaa actgttccac 
3001 ccctggacca gacatcaagc tttatccatt catataccat gctgtcgagt cctgtgcaga 
3061 gaccgctgac cattcagggc aaaggacagg gacctgagga gccgagcgaa tagcatctcc 
3121 tcccacctcc caccagagac gtcctgtttg agctgtcagg tgtaatatat gaattgactt 
3181 aagttaatat aaatgtgtac ataatccaca tttgtagtca aggacgcaat ctcttccaca 
32 4 1 catgtgcagt tgtcagttgg tacatctaaa ctccctccat cctgactcac gtggacttag 
3301 atatgttttg tttctatttt cttctatttc agtttttcat tctttgatgt ttatttcttt 
3361 tgtccatcag atctcttgtg aaatcccatg gaaggttgtg ctcagctgtc gggtctcttt 
3421 cttcctgccc atatattata ccagttgctt ctgcagcccg cagatgccca gcgatgccca 
3481 ggaaacaagt tgaaatccca ggaatctctt taactgattt tgctaaaaat ctccctgtga 
3541 gccttccact caactcttaa tatgcttgca ttgtttaagt ttttaaattc tgaaaattaa 
3601 taattagggt ttttttcata tgtgttgcat aatgcaaacc tcctaggtta aaatagtttc 
3661 tttatttaag atagaataat ttccagaaat tgtacttttg aggtatcatt tttatctgta 
3721 atggtttgtc tgtctttttt cctctgatca gtattttttt ataccagttt tggagactgc 
3781 ctgagatgaa aggaaatgtg gaataaaagg aggttttcct gatgtggtgt aaagaaaaca 
38 4 1 gattccaaga gaattgaaga ttttttttgt ttccttggta cttttttctt tttaaattag 
3901 gactaatgtt tcttttgtgg tgcttgaggc atattcatat aaccaaagtt tgagaactgg 
3961 gaacttcatg ctgatttgta catattgaag tttctctggt attcaaaggt tatatagtga 
4021 atgaattttc attaataaat cactttgtca gaaaaaaaaa aaaaaaaaa 
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(2) IN FORMAT ION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 6 amino acids 

(B) TYPE: protein 

■(C) STRANDEDNESS : single 
( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



61 

121 

181 

241 

301 

361 

421 

481 



.3** 661 
7 2 1 

^ 781 

W 841 

4;: 901 

r (2) 

□ 

5 



MNAGPGCEPC 
QAGQHRGSAT 
HNVEKLVNVS 
QFSNANIDRS 
CHPGEDCASC 
ESPMDVDNSK 
GGEVRLHFQF 
EDRRKEQWET 
TVTIRVDLLR 
TALLNKFTRP 
ALCLPNICTQ 
FNRLFEGRSS 
TYEGTIEENG 
LI ITGTEQYS 
MRRELNKAYC 
TFGDSELMRD 
ESCAETADHS 



TKATRWGAAT 
SLVFKQKTIT 
QLSLDKSLTE 
PQNDDHSDTD 
QQDEIDVVPK 
NSCQDSEADE 
EGGESRTGMN 
KHQRTERKIP 
AGEVPKPFPT 
QNLKDAILKY 
PI PLLKQKMN 
RKPEKLKTLF 
QGMLQVDFAN 
EYTGYAETYR 
GFLRPGVSSE 
IYSMHI FLTE 
GQRTGT 



gtgctgcggg 
ggggcgccgc 
agaggcgtgt 
cagcctgcgt 
aacaaaagac 
gtaaagaaaa 
tttacccaca 
ccacagaaaa 
agaatgaagg 
taccaaagca 
acacagatca 
atacaaagcc 
gccagtctgt 
gtcctttgtc 
tgactggaca 
agtcaccaat 
aaacaagtcc 
caagttgcca 
gtgaagttag 
atgccaagcc 
atggaaaaag 
gaagaaaaga 
tcccacctaa 



tcccagcatg 
tggaacttct 
tctcgacccc 
ctcggggcgg 
tattactact 
caacaataca 
taaggtggaa 
gagttcacag 
gaagcatgca 
gcttagtaat 
tgaagaagac 
aacagtagga 
gaaagactgt 
agatgttggt 
agaaagcagc 
ggatgtagac 
agtctttgat 
agcaagagaa 
attgcatttc 
atctggaaac 
ggattctaaa 
acaatgtgaa 
cctccctcca 



agtgcgggcc 
gcgccgactg 
aaggacgctc 
gcgggaccgc 
tggatggata 
agaattgact 
aaattggaaa 
tatttgaacc 
gaacagcttt 
gctaacattg 
agagacaatc 
gatgggcagg 
acaggctgtc 
gccgaggaca 
ctaggtgatt 
aactcgagaa 
gagcaagatg 
gctgatggcg 
caatttgaag 
tcttctagcc 
attacagatc 
gtcagacatc 
gagaagaagt 



PKDAHVQFRV 
NNNTRIESMM 
GKHTEQLLES 
QTTEDEHARE 
LIRQESCLGN 
PSRFQARDAD 
KQHGKKDSKI 
MRRMPRCGIR 
YPVEDENGER 
LEEAEAQHLY 
TFPRRNAKMK 
SLEDFPEWER 
INPELIISRL 
IVAIDALHFR 
LKALIQILAA 
RNCSTPGPDI 



ccggctggga 
cctcggactc 
ccgtccagtt 
acagaggcaa 
ctaaaggacc 
ccatgatgag 
atgttcctca 
aacagcagac 
tggcaagtga 
gtcagtcacc 
agcagtttct 
ccagaagcaa 
aacaggagga 
ttggaactgg 
cgcctccatt 
acagttgtca 
atcgttcctc 
atcttaggaa 
gagaaaataa 
ttaatgtaga 
atttcatgag 
aaagaacaga 
ggctgggaac 



PPSSPACVPG 
SSVQKDNFYQ 
EPQTVTLVPE 
AKSHQKCSKS 
SPPFEKESEP 
IEFRKRYSTK 
TDHLMRLPKA 
LPLLRPSANH 
TAGSRWELIQ 
QSILPDMVKI 
SEYSSYPDIN 
CEKPLTRLHV 
FTEVLDHNEC 
RYLDQFVPEK 
AAAERDVVYF 
KLYPFI YHAV 



gccctgcacg 
ccggagcttc 
cagggtccct 
cgccacctcg 
caagacagct 
ttctgtgcag 
gctaaatctt 
tgcgagtgtg 
gcctcccgcg 
ccacactgat 
tacacctata 
ctgtaagtgc 
ggtggatgtg 
accaaaaaat 
tgagaaagaa 
agattcagaa 
ccaaacagca 
acggtatttg 
tgctgggacc 
gtgtagaagt 
aatttccaag 
aaggaagatt 
tcctattgag 



TSPAASDARS 
SWMDTKGIKT 
KSTQYLNQHQ 
SEENRDNQQF 
SPLSDVGSED 
ETSPGFDEQE 
DLNAKLPGNI 
KYVPPHLSPD 
HYKDLWDNKH 
NVAYSKKWDF 
HSITMSQEQI 
CYFRRVTEKK 
RFVGGGVTSA 
WSRSHEDGSE 
NLSAVATGNW 
RKLTVGDVYK 



FPSRQRRVLD 
AESESLDSKE 
TAAMCKWQNE 
LTTVKLANAK 
VGTGSKNDNK 
DGSSSQTANK 
SSLNVECRNS 
KKWLGTPIEE 
VKMPCSEQNL 
TALIDFWDKV 
ASLLANAFFC 
PTGLVT FTRQ 
GLVQEEIRFL 
RDDCERRCTE 
GCGAFGGDAR 
LLLRYYNEEC 



INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 



1 gggggactgt 
61 aaagcgcgct 
121 cctggcaggc 
181 ccgtcctcgc 
241 tttgttttca 
301 gaatcagaaa 
361 aaagataact 
421 gataaatcac 
481 tgcaagtggc 
541 gggactccgc 
601 gaccacagtg 
661 aaacttgcaa 
721 agtggatctc 
781 ctaccagaga 
841 gacaacaaat 
901 agtgagcctg 
961 gcagatgaag 
1021 aataaacttt 
1081 actaagggaa 
1141 agtgacttaa 
1201 tccaagcagc 
1261 tcagaggaca 
1321 ccaaaataca 
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1381 gaaatgagaa 
1441 cacacagtga 
1501 acacattaca 
1561 ttgtaccctg 
1621 cagactgcac 
1681 tacaatgtgg 
1741 gtacttgaag 
1801 attgcactct 
1861 aatcattctg 
1921 tgcacatttc 
1981 aacttcaatc 
2041 ttctgctact 
2101 cagagtcttg 
2161 gtcacttacg 
2221 aatcgttttg 
2281 ttaatcaatc 
2341 tgtcttatta 
2401 cgttgggccc 
2461 gagatcgttg 
2521 aaagtgagac 
2581 gaaaatcttt 
2641 agattaaaag 
2701 ttcacctttg 
2761 gagaggaagc 
2821 tgcagaaact 
2881 gttgagtcaa 
2941 agtgactagg 
3001 attgactgaa 
3061 cttctgcata 
3121 ttcaattttt 
3181 tgggaaaact 
3241 tgtccagact 
3301 gaaatctgag 
3361 tgtgttgtct 
3421 tatcttttag 
3481 aaacatcatc 
3541 agctggccaa 
3601 gccaggttca 
3661 agggtgagtg 
3721 gagacttgat 
3781 aaatttcatt 



aaatgcctcg 
ctgttcgggt 
aagatttgtg 
tggaagatga 
ttctcaacaa 
catattctaa 
aagcagaggc 
gtctgccaaa 
tcacgatgtc 
cccgacggaa 
ggttgtttga 
ttcgaagagt 
aagattttcc 
agggtaccat 
ttggaggtgg 
ctgaattgat 
tcacaggtac 
gaagccatga 
ccattgacgc 
gtgagcttaa 
ctgcagtggc 
ccttaataca 
gggactcaga 
tggatgttgg 
gttccacccc 
gtgcagagac 
acctcctctc 
tctgatctaa 
tgcagttgtt 
cttctccttc 
caagaaaggt 
gctttggtgg 
tttgctgcag 
acattttcaa 
gctaaaacta 
cctataagta 
gaggaaagaa 
agagcgt tgg 
cttgttctgt 
gctgatttgt 
aataaatcat 



gtgtgggatc 
agaccttctg 
ggataacaaa 
gaatggtgag 
attcacacga 
gaaatgggac 
ccaacattta 
tatttgcacc 
acaggaacag 
tgccaagatg 
aggacgttca 
cacagagaaa 
agaatgggaa 
agaaggcaac 
tgtgactggt 
tgt t tcacgg 
tgaacagtac 
agatgggagt 
acttcacttc 
caaggcttac 
cacgggaaac 
gatcctggca 
gttgatgaga 
aaaagtgtac 
tggaccagac 
cactgacatg 
aaagagacat 
gtgtgtatat 
tcttgttcat 
agtctttatt 
tgcactcagc 
ctagcagata 
atttacctgt 
ttttggaagt 
ttttctattt 
acggtttttt 
aatgtagaat 
ggtttttgtt 
ggtgctgagg 
acatatggaa 
ttgtcagaaa 



catttgcctt 
agagcaggag 
catgtgaaaa 
cgaactgcag 
ccccagaact 
tttacagctt 
tatcagtcca 
cagccaatac 
atcgccagtc 
aaatcggagt 
tcaaggaaac 
aaacctacag 
aggtgtgaaa 
ggccgaggca 
gcgggacttg 
ctgttcactg 
agtgaataca 
gaaaaggacg 
agacgctacc 
tgcggattcc 
tggggctgtg 
gctgctgcgg 
gacatttaca 
aagttattgc 
atcaagcttt 
ccaggacaga 
cctatttgaa 
aatccacatt 
cctggtggac 
ctttgatttt 
ttctagatct 
ccatcacact 
gagcttctca 
tgaagtttt t 
aaggcagact 
tcgtcctttt 
aaaaggattt 
tttttcaaga 
gcatagtcct 
gtttctctgg 
aaaa 



ccttaagacc 
aggttccgaa 
tgccttgttc 
ggagtaggtg 
tgaaggatgc 
tggttgattt 
ttttacctga 
cactcctgaa 
ttttagctaa 
attctagtta 
cagaaaaact 
gattggtgac 
agcctctgac 
tgctacaggt 
tacaagaaga 
aggtgctgga 
caggctatgc 
attggcagcg 
tcgatcagtt 
tccgtcctgg 
gtgcctttgg 
ctgaacgtga 
gcatgcacac 
ttagatacta 
atccattcat 
aggcaggcac 
atgtggggtg 
tgtaatcaag 
atgcctttag 
ttttttccaa 
ttctcttcct 
tggaggaagt 
ctcccaaccc 
cttatgttac 
aatttccagt 
ttccccagcg 
tcctcggatg 
cttgtttttc 
gtaaccaaag 
caggaaatat 



atctgcaagt 
accttttcca 
ggaacaaaac 
ggagctcatt 
gattctgaaa 
ctgggataag 
catggtgaaa 
acagaagatg 
tgctttcttc 
cccagacatt 
gaaaacactc 
atttacaaga 
acgcttacac 
ggattttgca 
aatcagattt 
tcacaatgag 
tgaaacttat 
gcgctgcacg 
tgtgcctgag 
agttccttct 
gggtgacgct 
cgtggtttat 
tttccttacc 
caatgaagaa 
ataccatgct 
ctgaggaaca 
tgatgtctga 
gatgcagtct 
acatggcttc 
cttgatttct 
gtctgtgtgt 
tacaaatcca 
ttgttaggct 
ttaatgctag 
ttctcttttg 
ctattttaga 
ctataaagaa 
ctttgcagct 
gtctttgctg 
tagagttaat 



(2) I N FORMAT I ON FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 968 amino acids 

(B) TYPE: protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID 

1 MSAGPGWEPC TKARWGAAGT SAPTASDSRS 

61 RAGPHRGNAT SFVFKQKTIT TWMDTKGPKT 

121 EKLENVPQLN LDKSPTEKSS QYLNQQQTAS 

181 NANIGQSPHT DDHSDTDHEE DRDNQQFLTP 

241 CTGCQQEEVD VLPESPLSDV GAEDIGTGPK 

301 DNSRNSCQDS EADEETSPVF DEQDDRSSQT 

361 FQFEGENNAG TSDLNAKPSG NSSSLNVECR 

421 EVRHQRTERK IPKYIPPNLP PEKKWLGTPI 

4 81 LRAGEVPKPF PTHYKDLWDN KHVKMPCSEQ 



NO: 6 : 

FPGRQRRVLD PKDAPVQFRV PPSSPACVSG 
AESESKENNN TRIDSMMSSV QKDNFYPHKV 
VCKWQNEGKH AEQLLASEPP AGTPLPKQLS 
IKLANTKPTV GDGQARSNCK CSGSRQSVKD 
NDNKLTGQES SLGDSPPFEK ESEPESPMDV 
ANKLSSCQAR EADGDLRKRY LTKGSEVRLH 
SSKQHGKRDS KITDHFMRIS KSEDRRKEQC 
EEMRKMPRCG IHLPSLRPSA SHTVTVRVDL 
NLYPVEDENG ERTAGSRWEL IQTALLNKFT 



569823.5 



75 



541 RPQNLKDAIL KYNVAYSKKW DFTALVDFWD KVLEEAEAQH LYQS ILPDMV KIALCLPNIC 

601 TQPIPLLKQK MNHSVTMSQE QIASLLANAF FCTFPRRNAK MKSEYSSYPD INFNRLFEGR 

661 SSRKPEKLKT LFCYFRRVTE KKPTGLVTFT RQSLEDFPEW ERCEKPLTRL HVTYEGTIEG 

721 NGRGMLQVDF ANRFVGGGVT GAGLVQEEIR FLINPELIVS RLFTEVLDHN ECLIITGTEQ 

781 YSEYTGYAET YRWARSHEDG SEKDDWQRRC TEIVAIDALH FRRYLDQFVP EKVRRELNKA 

841 YCGFLRPGVP SENLSAVATG NWGCGAFGGD ARLKALIQIL AAAAAERDVV YFT FGDSELM 

901 RDIYSMHTFL TERKLDVGKV YKLLLRYYNE ECRNCSTPGP DIKLYPFIYH AVESSAETTD 

9 61 MPGQKAGT 



(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2781 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



1 tcgaagtgtg 
61 caccggtatg 
121 atctacggca 
181 tttgtgctta 
241 aacggaggag 
301 catggaggct 
361 tgccggcaat 
421 ctacaaatca 
481 gaaatatccg 
541 cgaacgagcc 
601 atataatacc 
661 ggaactggac 
721 attggcattg 

7 81 gaacgcctca 

8 4 1 gtgcacgttt 
901 catcaacttt 
961 cattatgcac 

1021 cggtgtggta 
1081 aagtgcggcg 
1141 tgagggcatt 
1201 gggacatggc 
1261 taaactcttt 
1321 gtatagtaac 
1381 aacgccaaga 
1441 ttttgcccag 
1501 gtacattgga 
1561 gggttgcggc 
1621 cgcccagttg 
1681 ttttcatgaa 
1741 tattttaagg 
1801 taaggcatcc 
1861 tgtgcccgga 
1921 aggaaaatca 
1981 gcaaatcacc 
2041 ctctaattct 
2101 catgatggag 
2161 aacgtccaaa 
2221 tctgctcgag 
2281 atcacccaac 
2341 cgacaaggac 
2401 tgcccccaaa 
2461 ggcattttaa 



tggtatttat 
caagaattca 
aatcgccgca 
aactgcgcca 
gagccggaaa 
atacatcgta 
ctccaccggg 
ccgggaaagt 
agggagaatc 
cttctgcagc 
acctataggg 
gagagcgaaa 
cggctaccgg 
ttgagcctga 
ccccgaagaa 
aacaggcttt 
tattttcgtc 
acctttgttc 
ccgttgggtg 
ggactgctgc 
tgcgttcagg 
acggagtgtc 
tatacgggat 
gatagctcag 
tcacatcatc 
tttgttcact 
gcattcggcg 
ggcagacctt 
atgtggctgt 
tcgtacagta 
aaaaagaagc 
gagggagcat 
gaaacatcag 
ataacgcaac 
tcggaagatc 
gccgctagtc 
acgagcagta 
atgctggaca 
tgcagcaagg 
gaaaaggacg 
aattgaagag 
agtaggatca 



aaagtgcgat 
ggtcacactt 
gagcaagtgc 
ggatgtcgaa 
atctggcgaa 
atcggcagcc 
ttatgtacca 
gggactccga 
cggatggcag 
ccataaagac 
atcagtggca 
cacgggtttt 
acttgattca 
gccaacagca 
acaccctcaa 
accaatcgac 
gcgtgtgtcc 
gtcggagcgg 
atgtgccatt 
aagtagactt 
aggagatacg 
tgcgaccatt 
atgccggaag 
gtcgtcgaca 
aatatcgcga 
ggatggtgac 
gtgactccta 
tggcctacta 
tgtttcgaaa 
ggcttattaa 
tatatgattt 
ccgccgaagc 
cgaaatcctc 
aaagtaccga 
aggcccttct 
tggaggctaa 
cagccacgaa 
cccattatga 
ctgagggttc 
atattgttga 
gtataaaatg 
ttgtaagtcg 



attcatcaca 
gatttttccg 
atccgtgctg 
gtcgccggat 
ctccctagat 
tttcgaattg 
gctgccaatt 
gcatgtgcgt 
caccaccatc 
gtgtgaggaa 
ctttcgtgcc 
cttcgaggat 
atcgccagtt 
ga t ct cctgc 
gaggaagtcc 
gggaccggca 
cacagagcgg 
attgccggaa 
gcacgtggat 
t gccaacaaa 
ctttgttatc 
cgaggccctg 
cttcgagtgg 
aacggccatt 
ggatctcatg 
gccgccaccg 
t ctgaaagcc 
tacctttgga 
tgacgggact 
ggagaagagc 
tattaaagag 
tggaagctct 
gccagaactc 
tctattgccc 
tatgctgtcg 
aagcagcgta 
atcaatgggt 
aaagggttcg 
agcaaagagt 
ctaggtgata 
tattgtagat 
aataaagtga 



gctatcgctc 
atattccaaa 
accaatcgac 
ggcgggattt 
gattcgtggc 
gagaatttgc 
cgtgaaacac 
ctgccctgtg 
gat tttcgct 
ctgcaggcgg 
cttcatcaac 
eta t tgeege 
ccgctgctca 
ctgttggcca 
gagtacagca 
gttctggaga 
gatgecagea 
catctgatcg 
gecgagggaa 
tatttgggtg 
tgtceggage 
gtgatgttgg 
tccggcaact 
gtggcaatcg 
gaaagggagc 
ggtgtggcaa 
ctgctgcaac 
aatgtggagt 
aeggtgeage 
tccaaggagc 
gaacttaaga 
agagtagctg 
aacaagcaac 
gegcaattat 
gacgatgagg 
gaaataagca 
tcaggtggcc 
gectcgaaga 
cgtaaggaga 
t tgcactaca 
aactttaagg 
aatttttttt 



atccccaaaa 
aggtttacca 
teggcaagge 
ccgaaataga 
gtggagtttc 
caccagtgac 
cgccacgccc 
cgcccgagtc 
gggaaatgat 
cgataatatc 
ttctcgacga 
gcattatccg 
agcaccacaa 
atgecttett 
cttttccaga 
agcttaaatg 
atgtgcccac 
actggageca 
caatcgagga 
gcggtgtctt 
tattggtggg 
gcgccgaaag 
ttgaggattc 
atgccctaca 
tgaacaaggc 
ctggtaactg 
ttatggtctg 
ttagggatga 
agctttggag 
cgcgtgagaa 
aggtcagaga 
gattaggega 
ccgcccgacc 
cgcaagataa 
aggecaatge 
acagcagcac 
gecagttgag 
ggccacgaaa 
tcgatgtgac 
ggattgtt ac 
acatatttag 
tttttttaat 
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2521 tatactattc 
2581 ttaatataat 
2641 gccagtaaag 
2701 agaacatcaa 
2761 atatatatat 



taatctgcaa 
ttgttatatc 
tttcttttta 
caacaacaac 
ctaagcagat 



agacaatttt 
atgcaaatct 
tttaaccgaa 
cacgaaaaaa 
c 



actgttaaat 
agcttttatt 
accttttgtt 
aagcgaa tat 



ttgtataaca 
atgcgaaatt 
tattttattt 
atatttgttt 



ttcgaattaa 
tgtagttaaa 
gaccacaaca 
gttcgtatat 



(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 768 amino acids 

(B) TYPE: protein 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



1 MQEFRSHLIF 

61 EEPENLANSL 

121 SPGKWDSEHV 

181 TTYRDQWHFR 

241 SLSLSQQQIS 

301 HYFRRVCPTE 

361 IGLLQVDFAN 

421 NYTGYAGSFE 

4 81 GFVHWMVTPP 

541 EMWLLFRNDG 

601 GEGASAEAGS 

661 SSEDQALLML 

721 EMLDTHYEKG 



PIFQKVYQST 
DDSWRGVSME 
RLPCAPESKY 
ALHQLLDEEL 
CLLANAFLCT 
RDASNVPTGV 
KYLGGGVLGH 
WSGNFEDSTP 
PGVATGNWGC 
TTVQQLWSIL 
SRVAGLGEGK 
SDDEEANAMM 
SASKRPRKSP 



ANRRRASASV 
AIHRN RQPFE 
PRENPDGSTT 
■ DESETRVFFE 
FPRRNTLKRK 
VTFVRRSGLP 
GCVQEEIRFV 
RDSSGRRQTA 
GAFGGDSYLK 
RSYSRLIKEK 
SETSAKSSPE 
EAAS LEAKS S 
NCSKAEGSAK 



LTNRLGKALC 
LENLPPVTAG 
I DFRWEMIER 
DLLPRI IRLA 
SEYSTFPDIN 
EHLIDWSQSA 
ICPELLVGKL 
I VAIDALHFA 
ALLQLMVCAQ 
SSKEPRENKA 
LNKQPARPQI 
VEISNSSTTS 
SRKEIDVTDK 



LNCARMSKSP 
NLHRVMYQLP 
ALLQPIKTCE 
LRLPDLIQSP 
FNRLYQSTGP 
APLGDVPLHV 
FTECLRPFEA 
QSHHQYREDL 
LGRPLAYYTF 
SKKKLYDFIK 
TITQQSTDLL 
KTSSTATKSM 
DEKDDIVD 



DGGISEIETE 
IRETPPRPYK 
ELQAAI ISYN 
VPLLKHHKNA 
AVLEKLKCIM 
DAEGTIEDEG 
LVMLGAERYS 
MERELNKAYI 
GNVEFRDDFH 
EELKKVRDVP 
PAQLSQDNSN 
GSGGRQLSLL 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



1 AT GAG C AAG A AGTTTATCGA ACTGGGTGAT CCTGTCACTC AAGACGAGAA 

51 AGACTACGAA GACTATGTCG GAGTTGGTTT CGCGCATCAA GTCCCGACAA 

101 TGAAAAGGCG GAAGTTGACA GAACATGGAA ATACTACAGA ATCAAAAGAA 

151 GATCCTGAAG AGCCAAAAAG CCGTGACGTA TTTGTCTCCT CGCAGTCAAG 

201 TGATGAGAGT CAAGAAGATT CGGCTGAAAA TCCGGAGATC GCTAAAGAAG 

2 51 TGTCAGAAAA TTGTGAAAAT CTGACAGAAA CTCTCAAAAT TTCTAATATT 

301 GAGAGTTTGG ACAATGTTAC TGAAAGATCT GAACACACTC TTGATAATCA 

351 CAAAAGTACT GAACCAATGG AAG AAG AT G T AAACAACAAG TCCAATATTG 

4 01 ACGTTGCGAT TAATTCTGAC GAG GAT GAT G AACTTGTTCT G G AAG AG AAT 

4 51 AATAAAGAAA TGAGGGATGG AGAACAAGTA CAACAGTTGT CACAGGATTT 

501 ATTCGCTGAT GATCAAGAGC TAATTGAATA T C C AG G AAT T ATGAAAGACA 

551 CTACAACTCA AC T G GAT AT A ACAGATTCTG AAGTGGAGAC TGCTCAAAAA 

601 ATGGAAATGA TTGAAGAAAC T G AAG C AG AT TCGACATTTG TAGGCGAGGA 

651 TTCAAAAGCT ACGAAAACTG TGAGGACATC CAGTTCAAGT TTCCTGTCAA 

7 01 CTGTTTCAAC ATGCGAAGCC CCTGCAAAAG G AC GAG C AAG AAT GT AT C AA 

7 51 AAAGAGTTGG AAAAGCATGT GATTGCATTT AC T GAG G G AA ATCTCACACT 
801 ACAACCAGAT TTGAACAAAG TTGATCCCGA CAGAAACTAT CGATATTGTA 

8 51 CAATTCCGAA CTTTCCAGCT TCCCAAGGAA AACTTCGAGA AG AT AAT CG A 
901 TATGGCCCAA AAATCGTTTT GCCTCAAAGA TGGCGAGAAT TTGATTCGAG 

9 51 GGGCCGTAGA AGAGACTCAT ATTTCTATTT CAAACGTAAG CTCGATGGAT 
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1001 ATTTGAAATG CTACAAAACA ACTGGATATT TTATGTTTGT TGGACTTTTG 

1051 CACAACATGT GGGAATTTGA CCCAGACATC ACATATAAAC TGCCAGCACT 

1101 GGAAATGTAT TACAAAGAGA TGTCGGAACT TGTTGGTAGA GAAGAGGTTT 

1151 TGGAAAAATT TGCACGAGTT GCCCGCATCG CAAAAACTGC TGAAGATATT 

1201 CTGCCAGAGC GAATTTATCG TCTTGTTGGT GACGTCGAAT CAGCTACCTT 

1251 GAGCCACAAG CAATGTGCTG CACTTGTTGC GAGAATGTTT TTTGCCCGAC 

1301 CGGACAGTCC TTTCAGTTTC TGCCGAATTC TCTCGTCTGA TAAATCTATT 

1351 TGTGTGGAGA AACTTAAATT CCTGTTCACT TATTTCGACA AAATGTCAAT 

14 01 GGATCCACCG GATGGTGCCG TCAGTTTTAG ACTTACAAAA ATGGATAAAG 

14 51 ATACGTTCAA CGAAGAGTGG AAAGATAAAA AATTACGTTC TCTTCCTGAA 

1501 GTTGAATTCT TTGATGAAAT GCTTATTGAA GACACAGCTC TCTGTACACA 

1551 AGTTGATTTT GCGAACGAAC ATCTTGGTGG CGGAGTTTTA AATCATGGGT 

1601 CTGTTCAGGA GGAGATCCGT TTCTTGATGT GTCCAGAAAT GATGGTTGGA 

1651 ATGTTGTTGT GCGAGAAAAT GAAACAACTG GAAGCGATTT CAATTGTTGG 

17 01 AGCTTACGTT TTCAGTTCTT ATACTGGTTA TGGTCATACT CTAAAATGGG 

17 51 CAGAACTTCA ACCAAATCAT TCTCGTCAGA ATACAAACGA ATTTCGAGAT 

1801 CGTTTTGGAC GTCTTCGGGT AGAAACTATT GCAATCGATG CAATTCTGTT 

1851 CAAAGGATCA AAATTAGATT GTCAGACGGA GCAGTTAAAC AAAGCAAATA 

1901 TCATTAGGGA AATGAAGAAA GCATCTATCG GATTCATGAG CCAGGGACCG 

1951 AAATTCACAA ATATTCCAAT TGTTACTGGA TGGTGGGGAT GTGGAGCATT 

2001 TAATGGGGAC AAGCCACTGA AGTTCATAAT CCAAGTAATT GCTGCCGGAG 

2 051 TCGCTGATCG TCCACTTCAT TTCTGTTCAT TTGGAGAACC CGAGCTTGCC 

2101 GCAAAGTGCA AG AAAAT TAT AGAACGAATG AAACAGAAGG ACGTAACACT 

2151 TGGTAAGTCA TGTTTTTCAA TCTTCAGTTG A 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 amino acids 

(B) TYPE: protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



1 MSKKFIELGD 

61 FVSSQSSDES 

121 EPMEEDVNNK 

181 MKDTTTQLDI 

241 PAKGRARMYQ 

301 YGPKIVLPQR 

361 TYKLPALEMY 

4 21 QCAALVARMF 

4 81 MDKDTFNEEW 

541 FLMCPEMMVG 

601 RFGRLRVETI 

661 WWGCGAFNGD 

721 CFSIFS 



PVTQDEKDYE 
QEDSAENPEI 
SNIDVAINSD 
TDSEVETAQK 
KELEKHVIAF 
WREFDSRGRR 
YKEMSELVGR 
FARPDSPFSF 
KDKKLRSLPE 
MLLCEKMKQL 
AIDAILFKGS 
KPLKFIIQVI 



DYVGVGFAHQ 
AKEVSENCEN 
EDDELVLEEN 
MEMIEETEAD 
TEGNLTLQPD 
RDSYFYFKRK 
EEVLEKFARV 
CRILSSDKSI 
VEFFDEMLIE 
EAISIVGAYV 
KLDCQTEQLN 
AAGVADRPLH 



VPTMKRRKLT 
LTETLKISNI 
NKEMRDGEQV 
STFVGEDSKA 
LNKVDPDRNY 
LDGYLKCYKT 
ARIAKTAEDI 
CVEKLKFLFT 
DTALCTQVDF 
FSSYTGYGHT 
KANIIREMKK 
FCSFGEPELA 



EHGNTTESKE 
ESLDNVTERS 
QQLSQDLFAD 
TKTVRTSSSS 
RYCTI PNFPA 
TGYFMFVGLL 
LPERI YRLVG 
YFDKMSMDPP 
ANEHLGGGVL 
LKWAELQPNH 
ASIGFMSQGP 
AKCKKI TERM 



DPEEPKSRDV 
EHTLDNHKST 
DQELIEYPGI 
FLSTVSTCEA 
SQGKLREDNR 
HNMWEFDPDI 
DVESATLSHK 
DGAVSFRLTK 
NHGSVQEEIR 
SRQNTNEFRD 
KFTNIPIVTG 
KQKDVTLGKS 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
LFTEVLDHNE CLI ITGTEQY SEYTGYAETY R 
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(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein fragment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AYCGFLRPGV SSENLSAVAT GNXGCGAFG 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
FLINPELIVS R 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
IALXLPNIXT QPIPLL 



(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein fragment 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAYCAYAAYG ARTGYYT 



(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein fragment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CKRTANGTYT CNGCRTA 
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(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
AT CATC AC AG GTACTGAGCA GTAC 



(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GCCTGTGTAT TCACTGTACT GCTC 



(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
EDKRKEQCEM KHQRTERKIP KYIPPH 



(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
EDRRKEQWET KHQRTERKIP KYVPPH 



(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

EDRRKEQCEV RHQRTERKIP KYIPPN 
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(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
HQVPTMKRRK LTEHGNTTES LLLKEDPPEP KS 



(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

gT| (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 3 

£> 

HJ EGKRKGDEVD GVDEVAKKKS KKEKDK 



(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 
g (B) TYPE: polypeptide 

La ' (C) STRANDEDNESS: single 

Lr t (D) TOPOLOGY: linear 

y (ii) MOLECULE TYPE: polypeptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 



y? EGKRKGDEVD GTDEVAKKKS RKETDK 



(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

EGKRKGDEVD GIDEVTKKKS KKEKDK 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
EGKRKGEEVD GNVVAKKKSR KEKEK 
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(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
EGKRKADEVD GHSAATKKKI KKEKEK 



f 5 ^ 



(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
EELPDTKRAK MELSDTNEEG EKKQR 



4* (2) INFORMATION FOR SEQ ID NO: 29: 

U1 (i) SEQUENCE CHARACTERISTICS: 

1=4 (A) LENGTH: 31 amino acids 

~ (B) TYPE : polypeptide 

L (C) STRANDEDNESS: single 

{ D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: polypeptide 

%0. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 9 

□ 

il EGVSSAKKAK IEKIDEEDAA SIKELTEKIK K 



[2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

GCTGCGGGTC TCGACGATGA GTGCGGGC 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GCGTCTAGAA TTCACTTGGC TCCTCAGGC 
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(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 38 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CCGGAATTCG GGTTTTTTGT TAATGAAAAT TTATTAAC 



(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
TCAGAGCAGA TGAACTCGAG CAGTCCAGG 



(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

1 CCAATTTGAA GGAGGAATTC CCGCCGCCAC CATGAATGAT GTGAATGCCA AACGACCTGG 
61 A 



(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 35: 
gaattcccgc cgccaccATG AA 



(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 674 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
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1 agaagaaaat ggccaaggca tgctacaggt ggattttgca aatcgttttg ttggaggtgg 

61 tgtaaccagt gcaggacttg tgcaagaaga aatccgcttt ttaatcaatc ctgagttgat 

121 tatttcacgg ctcttcactg aggtgctgga tcacaatgaa tgtctaatta tcacaggtac 

181 tgagcagtac agtgaataca caggctatgc tgagacatat cgttggtccc ggagccacga 

241 agatgggagt gaaagggacg actgcgagcg gcgctgcact gagatcgttg ccatcgatgc 

301 tcttcacttc agacgctacc tcgatcagtt tgtgcctgag aaaatgagac gcgagctgaa 

361 caaggcttac tgtggatttc tccgtcctgg agtttcttca gagaatcttt ctgcagtggc 

421 cacaggaaac tggggctgtg gtgcctttgg gggtgatgcc aggttaaaag ccttaataca 

481 gatattggca gctgctgcag ctgagcgaga tgtggtttat ttcacctttg gggactcaga 

541 attgatgaga gacatttaca gcatgcacat tttccttact gaaaggaaac tcactgttgg 

601 agatgtgtat aagctgttgc tacgatacta caatgaagaa tgcagaaact gttccacccc 

661 tggaccagac atca 



(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID 

1 aaaaatagtt gtcaagactc agaagcagat 

61 gaagatggta gttcctccca aacagcaaat 

121 gacattgaat ttaggaaacg gtactctact 

181 tttgaaggag gagagagtcg 



NO: 37: 

gaggagacaa gtccaggttt tgatgaacaa 

aaaccttcaa ggttccaagc aagagacgct 

aagggcggtg aagttagatt acatttccaa 



(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29793 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

1 gatctcgaag taaaaactca cgcagaaaga gctcctcctc ctttagcatg agaatccaac 

61 tttgtaatga taacactggc aacatcaaca gtttgagaga aagcacgtgc ttgggcttca 

121 caagcttgtc caatagaagc atccatcaca aaaacaacat tatctggtgt aactgcgttg 

181 gaaacttgga gcatttcttc gaaaagtgaa gcttcttgct tgtgacgacc tgatgtatca 

241 acaatgatga tttcgaaccc ttcttgctgc aaaacaaata ttattaaacc atttttctgt 

301 gataaattac cgtgaatttt tctactcctt cggcggcaat ttttacgggg tcaatttcag 

361 agtatgatcc atagaaggga atacgagctt ttgtggcatt ttgctttaat • tgatcaaaag 

421 ctccagcacg gaatgtatcg gcacagatca gacatgtttt ccatcctttt ctttggtagt 

481 aatacgccat ctgaacttga aaagtgttga aaagttgttg gaagtttact aattaaaaaa 

541 tataatgttt gatggtgtgt gagctttcta ttgtaattca tggaacgaac cttggtacaa 

601 gtcgtagttt taccggaacc ttgaagacca acaaacatga aaacgttgcg acgtcctttt 

661 gttggtgtga aaggagttac accaggatcc acaagcttca gcagttcatt gaatactgtc 

721 ttctgaatgt accgacgttt gtttgctcct ccgacgatct cttcgaaatt aatcgctttt 

781 ctgaaaatat ttattaaatt taaatcttaa atagcgtaaa aatttacttc acgttgtcct 

841 taagttgctt tacaagacga atatgaacat cagattcaat aagagctgta cagacttctt 

901 tcagcatcaa atccagctcc ccctcattga taacggtgct ctgaccgagc tttccgatcg 

961 catttcggat tttccgcccc aaatcggcca aaaccatttt gaactgaaat ttgaaatgct 

1021 ttaatttgtt taagcataga attaaacgcg ttttaaatcg agagcaccat aaaaacagtt 

1081 tggagaaaaa tcgataattc ttgtaggaga ttcagtccct gtggttttct tcggcttcct 

1141 aatcattttt tgacgacata gtggtatttc acaataggtt ttttcaagac acaacagatt 

1201 tttcacaaag agtagagaag aaatggaaaa ctgtagattt cttctcgaag agccgagaaa 

1261 ggcaaggtat tggaagttta aaaaggtaat gtttctttat tcttttttca aaacaataat 
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1321 aaatggaaaa tatatattta tagataacaa tttcagacag ttaaaatcac gtgaaaaatt 

1381 caaatttcaa cacaaaaatt gacgagtgga accccgttgt tgcgccttga agagtaacgc 

1441 ttgcgcgttt gacgatttta ttgacgcgtt tctggtgcat gcgggaaatt ttttattttc 

1501 aacttttttc ctgtttgttt atcctttttt aattgaattc tcatgatttg aaagctttga 

1561 aaaatattat tttgctcaaa aacatgcgtt ttgtaaaaca ttgattagat tcaaggcaat 

1621 taatggattt ttgcacgttc caaaaaaaag gaaattcatt ttttgaaaat tttgataatt 

1681 taataatgaa aaatgttcca tagatttatt caatgccatc cttctctata atctcgaact 

1741 tccgcatcct tcaactgtgg tagaggtatt tgcaatacca tatagtcgta ataataaact 

1801 ttagtgaaca aatccaagac atcagctctt gagtaaatga atgatttata aaaactgctg 

1861 attttctcgt aggaagaaag agaatcagct aataatccgt cgttgtctat tctgtcaggc 

1921 cgcttaaatg ttaaaaaata aaaacgtttt aagctaattt tgtatgtcta gaaactctaa 

1981 ctcacaagca tttctgcata cgccggatta gttggttttg caaaaagcga gtaatctaca 

2041 aaagtgaatt tttgattcat ctcttccatt tcacaaaacc aattttgtgg tacgtatttc 

2101 atatgatctt catccacttt tttagttttt gaatgtattt gtgtgagttg tgtccagatt 

2161 tgaataagat aacatctcag atccaacttg caattgaagc aagaacgatc ttctctgaaa 

2221 ttttatatga ccttaaactt tatacttttg tagtttcgtc gatatctgat cgttcagttg 

2281 tataggtatg tacatctcta ggtttatgtg ctacacgaaa atataatttg ttttacctaa 

2341 cacacgcatc cataaaatga tctacaaatc gttcaattgg atcctgtctt ggaaataata 

2401 atttccaatt cgtaaagttt gcattcaact cattttctcg tttcaaatcg tcgatatccg 

2461 caaaatatgt tagtgaatca ctatcacaca ctctgaaaag cacaatattc atatttcgta 

2521 gttaataatg aacctcacga ttcatcatta aatttctctt ggagcccgca taatacttgc 

2581 tgcccaatta aagtatcagt ttcacagatt gcagttctat catttccgat agcctcaaat 

2641 aagatttaat cttaagcgag tgttctgatc aatttaaata tttgatactc accgcaagtt 

2701 tcttcgaaac ttgttcgaaa gctggaattt tagaatatcc ttcaaaactt ttttcctcgc 

2761 cctcatcaag ccataataag ttttgatcag caatatattc gaataaatta gtctctgata 

2821 aatctcgtat cacaatcttt ttttctactc taaagaatac aattttgata agaatgataa 

2881 taattataat tataatagtt cgtcgctgag ttgatgaaga ccacataatt agtttaatgg 

294 1 caagctatgc aacttgttga atactaatag gacttagcaa atcttatctt gaaccttttt 

3001 cattcgaaag aaaaatgaga tcgaatctcg ttcaaactgt ggagtagtca gttaagaaac 

3061 ttgtttctag tttgtgagga gacactggag aacgtgaaag tattacccat acgcaatatt 

3121 tttgcggcga aaaatacggt acccggtctc gacacgacag tttttaaaac ttgtaaatag 

3181 gtatgtaaaa gaaaacttta attttaaacg tgttgtttcg gaattttcat cgttttgtca 

324 1 tagttattct acaaataatt atttatgaaa aaaaaactaa aatataacta taataacacc 

3301 tgaatattaa caaatcgatc gaaaaaaaac tatgaaaaaa atggatgaaa attccgcagc 

3361 aacgagagtt tgaaatttca gtattcttta aaggcttacc gatttcaata aatagtgaca 

3421 ctgaaaattg tagttttaaa actagttggt tagtatcatc aaatattcaa tccttcaaaa 

3481 attcctcaat attaacgtat tttctctaat tgtcttcatt atctaaaaaa aagttgcaat 

3541 atatttttcc aggcagaaat agactttcac aaaacacatc gacacttcga atgagcaaga 

3601 agtttatcga actgggtgat cctgtcactc aagacgttag ttatagtttt tattacttga 

3661 acattatcat ctttttacag gagaaagact acgaagacta tgtcggagtt ggtttcgcgc 

3721 atcaagtccc gacaatgaaa aggcggaagt tgacagaaca tggaaatact acagaatcaa 

3781 aagaagatcc tgaagagcca aaaagccgtg acgtatttgt ctcctcgcag tcaagtgatg 

3841 agagtcaaga agattcggct gaaaatccgg agatcgctaa agaagtgtca gaaaattgtg 

3901 aaaatctgac agaaactctc aaaatttcta atattgagag tttggacaat gttactgaaa 

3961 gatctgaaca cactcttgat aatcacaaaa gtactgaacc aatggaagaa gatgtaaaca 

4021 acaagtccaa tattgacgtt gcgattaatt ctgacgagga tgatgaactt gttctggaag 

4081 agaataataa agaaatgagg gatggagaac aagtacaaca ggtcaggaaa ttttacaagt 

4141 gaatgaaata agttaatcac caaaatgaat aaggacattt cccatcagaa aggtcttctg 

4201 aattttaggt gtaatgttaa ttttttgctg tagtttttcc cattgtttga aatttttgcc 

4261 aaaattagtt attgcatacc cttcatgttt ttgaagattg tttaggaatg agaaaacatt 

4321 ttggacgctt ttattattag gacaccaaac actttttgtt gaaaaaacag ctcgtttaaa 

4381 aaaagctttt tccaaaaaat ctgacgcaag gcttgtgaat tttcgttttc ccctgatttt 

4441 taaaatttct cctaaagttt tttgctaata tttttcgcta tcgcgtaatt tactagtgaa 

4501 tcaacaaaaa attttttttt tttcatagat tttttataag tttttgaaaa catagattta 

4561 aaacttaaac ttaaattttg acaaggcgag aggaaaaaat taaaaattgc tgaacattca 

4621 gatgccggtt accttatttt tggttcaaaa atcccaatat tacgcgtctg ggttatagtc 

4 681 atttgccttt attaaattaa tggtgttcct tggaaaagta agttctgttt tgttttcagc 

4741 ttatcacttc atcaaacgga aggaaaggtt gattaaggaa agtaaacata ttttatgttg 

4801 ttcttgtcac ttcctccatt tcgcaataat ataactcgag aaatatagaa ttttgttcga 

4861 agttttcttt ttccttcaac attttaaata ttgttagtat tacccagaaa aatagaaaaa 

4921 atcgaagaaa tttgcaaaaa agcagacgta gaggctacgt acttcttaag cacgcccctt 

4 981 ttcttttaaa tttgttcggt cgtaccgaga tccggtacct tattttacaa cgttttctgt 
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5041 tccaaaaata ataatgtact gcagttgtca caggatttat tcgctgatga tcaagagcta 

5101 attgaatatc caggaattat gaaagacact acaactcaac tggatataac agattctgaa 

5161 gtggagactg ctcaaaaaat ggaaatgatt gaagaaactg aagcagattc gacatttgta 

5221 ggcgaggatt caaaagtgag acaaaatcat tctgacaagg attcctgcga gcactcagtc 

5281 aagagcgagt cacggcaact cggtccaaaa ccatttctaa ttagtaaact ctcaaaaacc 

5341 acaactaaat agcttaaaac ctttgtaaat tagcttattt ttgctaatta gcaatgattt 

5401 taagctaatt agttgtggtt tttgagagtt tactaattag aaatggtttt ggaccgagtt 

5461 gccgtgactc gctcttgact gagcacaagc aaacttttgt ggatgttgag aatcagcggc 

5521 aaagtggcac tactagtgac gaagttgacg cagattctca gattaatttg gtaagacaaa 

5581 gaaaatataa attttattac ccagatgcat attttcatga ttctgatgca aaaaatacgg 

5641 tacccgatct ggatactaca atttttgtaa aatgcgaaaa ggtttgcacc tttaaaaaga 

5701 actgcaattt caaacacttg ttgctgtgga ttgtttatcg gtttttaata ttttttggtg 

5761 agagtaaatg agaaaagcga gttcccgcat tatctgtgtg cgatttggaa tacagtactt 

5821 ttcaaagacg cacaccattt tgcatataac aaacatttgt cgtgtcgaaa ccgggtaccg 

5881 tgattttgca ttaaaagttg caaaatttca catagttttt ataatttagg ctacgaaaac 

5941 tgtgaggaca tccagttcaa gtttcctgtc aactgtttca acatgcgaag cccctgcaaa 

6001 aggacgagca agaatgtatc aaaaagagtt ggaaaagcat gtgattgcat ttactgaggg 

6061 aaatctcaca ctacaaccag atttgaacaa agttgatccc gacagaaact atcgatattg 

6121 tacaattccg aactttccag cttcccaagg aagtacgttg ttcaataaaa catactaggt 

6181 atataattaa ttatttcaga acttcgagaa gataatcgat atggcccaaa aatcgttttg 

6241 cctcaaagat ggcgagaatt tgattcggta catttctatt gaattaatta tatactactt 

6301 actagaaaca ccatggagaa agaatgcaaa aaattgaatt ttaaaaacta attttttaat 

6361 tttggctaaa ttttcagttt gaatttaatc caaaatgaaa actgcgacca atcaatgact 

6421 tttcaaaatc acttttcaac caatcaaacg gagtgtgctg ggctcgaaga cgctgattgg 

6481 ttcggaaatg ggcgtggttt ctcattttgg agggaattca aaaaaaggca tttggtcaca 

6541- gttgaaaatc atgttttcaa aagatgcatt ttttattcct tctcgatttt ttttgatttt 

6601 cttttgtggt atttctgaat ttaaaggtgg tgtagtcgaa tttttttatt gctttattag 

6661 actcaaaatt ttctgaaaac gccaaatttc ataatgaaac ttcttgaaaa ctcttcagca 

6721 aaaagttatg acggctcaaa aaatggccta aaattagtta agattggaga tttgaccgac 

6781 ttgtcaatgt cgcagcggct ggaaacaatt ttttttgaaa tcaccgtcaa attttaagta 

6841 tacaacttga ttattttgcg ttttaaactt tatttaggta tttaaaagtc gatggacggc 

6901 gagttttggc tcaaaaaaat taaaaatctc gccgtccatc gatttttaaa taccttaatc 

6961 aagaataaaa caaaaggtag gcaacttgta tattcaaaat ttgacggtga ttgcaacttt 

7021 taactaattt caggccattt tttgagccgt cataactttt ttctaaaaag ttttcaagaa 

7081 gtttcattat gaaattcggt gttttcagac aattttgagt ctaataagga aataaaaaaa 

7141 attcgagtac accaccttta agaaaatttt ggatttccgc tacgctaatc cacctttaat 

7201 caaaaatatt tgaagttatt caaagttaaa gaattatatt ttcagagggg ccgtagaaga 

7261 gactcatatt tctatttcaa acgtaagctc gatggatatt tgaaatgcta caaaacaact 

7321 ggatatttta tgtttgttgt aagtttttga aatacaattc gtttgaagat ttactctatt 

7381 ttcagggact tttgcacaac atgtgggaat ttgacccaga catcacatat aaactgccag 

7441 cactggaaat gtattacaaa gagatgtcgg aacttgttgg tagagaagag gttttggaaa 

7501 aatttgcacg agttgcccgc atcgcaaaaa ctgctgaaga tattctgcca gaggtatgat 

7561 ttatgagata tacagcattt cctctaatag tattgcatat aaacatttca ctttgaggtt 

7621 atatcttggt ttattttaaa aatatcaata aatacaaaac aatagaaaaa tgataaaaaa 

7681 acattttgtc agttgataat ttgggtatag tattcattca taatttgatt tttttagcga 

7741 atttatcgtc ttgttggtga cgtcgaatca gctaccttga gccacaagca atgtgctgca 

7801 cttgttgcga gaatgttttt tgcccgaccg gacagtcctt tcagtttctg ccggtgagta 

7861 atacaagaat gctcatattt ttagaatcaa tatttgcaag gaactttaat cttacgtacg 

7921 tcttaagatg agcattttcg cacatatctt acgcgcacga gtctcgacac gcgaacatcg 

7981 agcttctgta actcgtatca atttacaagc cgttattaca tcagttttta atgaatttta 

8041 agaaaatcgt gcaaaagtag tgtcgagagc cattcgcgta agatatggtg agatttatca 

8101 tttttagacg tctagtggat atctaacaaa actttataca tttttatttc agaattctct 

8161 cgtctgataa atctatttgt gtggagaaac ttaaattcct gttcacttat ttcgacaaaa 

8221 tgtcaatgga tccaccggat ggtgccgtca gttttagact tacaaaaatg gataaagata 

8281 cgttcaacga agagtggaaa gataaaaaat tacgttctct tcctgaagtt gaattctttg 

8341 atgaaatgct tattgaagac acagctctct gtacacaagt tgattttgcg aacgaacatc 

8401 ttggtggcgg agttttaaat catgggtctg ttcaggtagt tatttaaagg aatataagaa 

84 61 tttgaagttt tatttttttt atgcaggagg agatccgttt cttgatgtgt ccagaaatga 

8521 tggttggaat gttgttgtgc gagaaaatga aacaactgga agcgatttca attgttggag 

8581 cttacgtttt cagttcttat actggttatg gtaagtctag actttcaaaa aaaactgttc 

8641 caatatgtca atatatttca ggtcatactc taaaatgggc agaacttcaa ccaaatcatt 

8701 ctcgtcagaa tacaaacgaa tttcgagatc gttttggacg tcttcgggta gaaactattg 
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8761 

8821 

8881 

8941 

9001 

9061 

9121 

9181 

9241 

9301 

9361 

9421 

9481 

9541 

9601 

9661 

9721 

9781 

9841 

9901 

9961 

10021 

10081 

10141 

10201 

10261 

10321 

10381 

10441 

10501 

10561 

10621 

10681 

10741 

10801 

10861 

10921 

10981 

11041 

11101 

11161 

11221 

11281 

11341 

11401 

11461 

11521 

11581 

11641 

11701 

11761 

11821 

11881 

11941 

12001 

12061 

12121 

12181 

12241 

12301 

12361 

12421 



caatcgatgc 
aagcaaatat 
aattcacaaa 
agccactgaa 
ggattttgct 
gagaaatatt 
cgccaaaatc 
cggagctcaa 
tcaaataggg 
tcttccctat 
t tcaattatc 
ccacttcatt 
gaacgaatga 
t ttgaaaaag 
catgctattc 
cttcgataga 
accttcagta 
cgattaaatt 
acgttttgat 
cccgagatat 
aatacattcc 
gaattaattt 
caataacagc 
taatcgacga 
tcactgaaga 
agaaaagcga 
ca taaaatta 
ggggtactct 
cgggacacat 
gata tacat a 
aatt'ggttgg 
gaaaccaaat 
gaagtttaga 
cttgaagcca 
tcgtaaatcg 
tggcagatct 
ctgaattgaa 
ttcacaaacg 
aagtcgttca 
gcagtggcgg 
ggcttccttt 
atgacttgag 
attaaaaatc 
actttgctga 
attatagctc 
aaaactagtc 
acatttttac 
ttgaaaaatt 
gtattcgtgg 
ttcatgttgc 
aaact acata 
aatcaggcac 
gttctttaat 
cactggacga 
aatgacatta 
cttttcttta 
ctgaaaattc 
taaatacgga 
ttattttttt 
cccttcgatt 
tatttgaaaa 
aatgttcaaa 



aattctgttc 
cattagggaa 
tattccaatt 
gtgtatgtta 
tcttttgcaa 
gaaaaccaaa 
ggaatcagcg 
aaacactgat 
aagttaatct 
ttgtttaaat 
ggtttttttc 
tctgttcatt 
aacagaagga 
ttgtatcgag 
agtatgataa 
atttctactt 
tcccgagcat 
ctgttaaata 
tttaatgttc 
ctagttcgca 
tgtttttcac 
tgaaacagat 
agggcagtta 
gaataaaaag 
gctacatttc 
gaaatcatga 
aagcaccaca 
agttccattc 
ataaaatcta 
tatatatata 
tgtggtggaa 
taatgtataa 
agcacatgct 
atgacacttc 
ttcgactttc 
gaaataaaat 
acagaagaat 
gctgtgtttc 
gcttcctcat 
cagcatacgc 
cattgttgta 
acgattgatg 
aaaaatttgt 
ggcaaccgat 
atgttctcat 
aaaattactg 
tggaaaacta 
gaaggaacac 
tgagacccaa 
caatattggc 
tatttaagct 
gtaatgtgtt 
tttcccattc 
gggcattgga 
ttttttaaaa 
ttagttggct 
aaaaaattag 
tgtaatgggc 
ccgaataaaa 
agtagataga 
atcccagaga 
agtttggttt 



aaaggatcaa 
atgaagaaag 
gttactggat 
tttcattcgt 
aatggcctcc 
caacgaattt 
attcgctcca 
tggctagaaa 
aaattaaaac 
ttttcttttt 
agtcataatc 
tggagaaccc 
cgtaacactt 
ttggaaacag 
acaacaccgg 
atctcagtag 
aattcgaatc 
tgcatgctcc 
atattattat 
agtcagaaat 
aagtgtagtg 
ttcagagaca 
ttatgattat 
agatgagaag 
caacagaata 
ctttcgagat 
gactgtccaa 
aaaaacttct 
agcatgcttt 
tatttcatct 
gttatgaaaa 
tcgacgagaa 
atcattcatg 
cattgctcct 
gaatatccac 
cttttcataa 
aaatctcacc 
ctcatagaga 
gtgctgttca 
tgatt tagat 
atgcggttgc 
atgcgatggt 
gtggttccca 
aactatttcc 
gattgagttg 
agtttctcct 
taataaatta 
agaacatttt 
ctccctcata 
caattaattt 
ttcagcattt 
ttcgaggacg 
caatcgattt 
atgggagatg 
atgttaagat 
tagccttgcg 
atattcaaaa 
aacccattct 
tgatctactt 
aaagaaaaag 
cataaaaaat 
caatcagcac 



aattagattg 
catctatcgg 
ggtggggatg 
taaatattga 
ctatgtacct 
ttcacaattt 
cccatttttc 
gtgggcgtag 
aatctcgtta 
aaagatcgt c 
caagtaattg 
gagcttgccg 
ggtaagtcat 
cttttaatct 
cttgccacat 
ttcggaagat 
gcccacacgg 
gtctttaaaa 
ccgaaattag 
ttttcggagc 
tgaaaccaat 
gtgaggttga 
ggatgctgat 
ataggcgaga 
agaaatgtag 
aaagagattt 
attataaaat 
tgcaaacaaa 
gaaaagcgga 
catctagagg 
gagcaatttt 
tcatgatgag 
ctcacgatga 
tctcttgcgc 
atcttaccgg 
tttaaagtct 
tgaatatcca 
tcatctgaac 
tttgccggtc 
gctcgcatag 
atttggaacg 
ggaggaatgt 
ttttaaaata 
tcgctggcga 
acctgaacga 
tccacgtcgt 
caaaaatcac 
ttcaatgcgt 
aaagcatgcg 
caagagact c 
ttttcaatgc 
actataaatt 
tttctcccac 
attcatgtga 
gatatgctta 
gccacgttta 
aggtataaat 
atacaggaaa 
tgttttatgg 
gaatgtacga 
cacacagaaa 
taataatgtg 



tcagacggag 
attcatgagc 
tggagcattt 
agatggagga 
gaaaaaaaaa 
tgcctaaatt 
cgccaat ca t 
cttcttattt 
aaaaatgttt 
taaaagctac 
ctgccggagt 
caaagtgcaa 
gtttttcaat 
aaattctgct 
aagcactttg 
gttgagtctt 
ccataaagac 
aatcagtccc 
tatactcgcc 
atcgtcgtga 
ccatgcagac 
cattagatat 
ataggaaagt 
ataaagaacg 
ttggaaatcc 
at ctgcaaac 
cagtttctcg 
gagaaataaa 
gaacatacga 
atcaccatcg 
aaccgaaaat 
atgatgattt 
cgataggttc 
tcacacaatt 
gcggcacttt 
ga ta t cccga 
cgtttgcaag 
cgtaggagaa 
tcacgtgctt 
tgtcattcct 
gtgcgttttg 
ctgcaagat t 
aaaaaaaaat 
ctacttctct 
tcaaggttaa 
ctgtcgagca 
gccgaaaatg 
ctctcacgtt 
cct ttagttt 
tgattgaaag 
acttgagacg 
gtacctttgc 
tctggcagtt 
caa tccacac 
ccaggagtaa 
ccatttgaca 
ataatataaa 
accaaaaaat 
tgccgctcta 
gaatatcgtt 
agggaaacag 
aaaggtaacc 



cagttaaaca 
cagggaccga 
aatggggaca 
gagtgaatgg 
tgaaaaaatc 
tttgaatttt 
ttataatgtg 
cggaggaaat 
cttttttcaa 
cagtatctga 
cgctgatcgt 
gaaaattata 
cttcagttga 
aacttacagg 
aattctacgt 
cgaaatcatc 
cggttccttt 
cgtattttaa 
gtcatgaaag 
tatatgaata 
gtttatttct 
gggcaagtaa 
cagaacagta 
ttaacgaaaa 
ctaatcaaac 
aattcttgaa 
ctacagtctg 
cagacttgta 
tctattcggg 
ttactcatca 
caccaaaaca 
gcttctagca 
gttatgcatt 
tccattctcg 
tggcctggtt 
gaaacaatag 
tgaagtttct 
tggacggagc 
ctttacggga 
acgactttga 
aaa t cctcgc 
atagttatgt 
atttttacgc 
tattgtgagc 
aactaggcct 
ggctccgagt 
gggaaaagaa 
cgagactact 
tttaatttaa 
tgttataatt 
caaattgaat 
tatccagtgg 
tctttgtcat 
atcctgcaat 
atatcatatc 
ttatagttac 
tgcgatttgg 
tcccgcaaaa 
tgtgttatga 
tattatttat 
tatttctgac 
gtatcaatag 
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12481 tgatattttc ttattaaaaa actgttcgag actacaagaa ggcctgaaaa agcccgcaac 
12541 gacgactaaa ttcgaaattt cgaattaggt tttaaagatc agaagatcgg cagaaaagta 
12601 tctgataaaa atataagaaa tcggaatagg aatgcgatga ggaggtagaa atatggtgaa 
12661 gagatacaga agaatgaggt aagatcggat gaacttgaag cactttttga gatttttgat 
12721 ggtgaagttg gtggatgtag acgtttcatg gaacatctga aaattaagat ttttctaaaa 
12781 cacattttct atagaatata atagaatgcc aaatagagaa actagactta cttgaatttc 
12841 tttcgatttc tgtctttcaa ccttctaact gaaatcaact ttcgacgtgt tctcggtgtt 
12901 tcaacaacac catcaacaga acactcagca ccaaattcag catcggaatc atcagaagaa 
12961 gactcatcgg aatccaaata gaaattggat ttagtattca tcaattcaaa agaatccaat 
13021 gatactgtcg attcagcaag ttggactgaa cttgatggtt gactacgaac ccattgaggg 
13081 cgtcgaggca gaagtcgaga gtatgaggat gcaacgtgga ttgatgatga cgtcaacaat 
13141 ctttggtgtt gagatgaaga agtggctgat gcagatgttg acagacggaa tggagatgag 
13201 tgaagagcaa gaagacatct gaaaatttga aacgttgttt atgtggacag tactgtaaag 
13261 atcttacctt ggatcataac tacttgccct ctgttttctc ttctcttgac ttctacttaa 
13321 aagcatttcc gtctcgattc tccggttact tgaaaatcca actccagaat tttcagcaca 
13381 aagctgctct cccgaaccgt agactgttgc accacgttga ggggttgaca aggatctgaa 
13441 atcagatgtt taaagcatgg caagtagagc aacaatgtta accaaaattt ctgaaacttt 
13501 ttcgaatata gtcaaaaatt gacaataact cagtttcacc tatcatagtt ttggaagtca 
13561 accaaaaatt tttgaaattt cataaaaatt ccaaactttc taaaaatttg gaagattgat 
13621 atgattgata tgaaagtatt tatatatttt ttaacctggc agacgatact tcaccattaa 
p 13681 agacacacat gtggagaaga attattttac ttttagtaat ccaacgtttg cacttacctt 

%S 13741 ggagcatgca agcttttagt cattaaagct ggaattctag atggagttct tcttggtgtc 

fit 13801 gacattgttg aaataaacat tcgtggttca ttgattgatg atgacgtcat agaaccacgc 

't\ 13861 ccagatgaca atggattacg gtagtcatca gaatcagtag attcattcaa ttttctagtc 

13921 atttcttctg ttttctggaa aattaaattt taattaaaga tctaacaaaa atctggcact 
^ 13981 tacattaata agataatcaa catattctaa ctcattcatc gtttcattat tttctaattc 

HP 1404 1 tggcttcttc tcatcgaacc gttcggtggc attgtgtcgt tgcgggcttg accgtttttt 

III 14101 gaatttctga aatgtttttc atgcaatttt tgttcttatt tgtgtgtcat atacagtgaa 

k& 14161 aatcaaaaac tagtacaaac taattccgtt tagtaaataa aaaatcgatg taaaatctca 

1 14221 gcaaagccaa gatcttggcg ggtccttata tccaagtttt gttgccattt tatttcagat 

14281 attcttttcg aaagtcagaa aatttgaatt tagaatcgaa tggacccatt tcttgttttt 
^ 14341 ttttgttgca ttttttaact gtactttttt cgtcagcata tattttcact attaaaacag 

S 14401 aatattcatg acaataattc cacaaaaaaa cgtactttaa tatcatagtc gattggttca 

O 144 61 gaattggaac gagaaccttc gacgcgtcga ttgtcagatt ctcgattgat ggacgacgtg 

feO 14521 ctgactgaaa atttctggat tgaaaaaata ttcaaatgaa aaaataaatg agaaactcaa 

A 14581 agtctaaaaa atgaatgttg ttaataacga atatttctga tgagaagagg atagagaaaa 

JT 14 641 aaaacgagtc taataaaatg catg'tgatat cctgcataaa aatcccttct tttttcacta 

14701 atccttcgct caattcattc aaatagaact ttgatttcta ttagagttga ggttgtttga 
14761 acaattttaa taaattaaca ataagccata aaacctcgaa acgtaccatc atcattgagt 
14821 ttgaaaaagt ggacggatcc gagtcagtca cctctggaac aaatcgttcc agagcactga 
14881 aaacgacaac gttctcccca cagaatcgga ttgtctcctc gggaattgtc gcctcgacaa 
14 94 1 acgatcctga acctgaaaat tttcgatttt tgtaagctca atggatttta aactgaaaat 
15001 gtagtcaaga agtcaagaaa aactgatgga gttctaaatt cggtgttagg ttttgaaaag 
15061 atcgtcaaac aaacaaatgc ataaaactag gtagggaaca aatagtgaaa tagaaaaatg 
15121 aaaggcgaca actgccggga gcaagagtac acacaaagaa aaaaagttgc ggaagagcac 
15181 agagagcgtc agtccatcag aactgcatag ataaatagat aaagagaaac atgaaacata 
15241 aggccacccg ggagagacga caggccagtt ttccggtgaa gatgagagtg cgagaattag 
15301 ataagaaaac ggaaattgtg atgaaacttt ttcaatccaa acttctagaa ttataagaga 
15361 cacctaaagt aattagataa gtgttttaag tgatatttta gattcactgt atcatgttta 
15421 aaaaagatat ttcaaaaata tatacctgat ataggaggcc tcctctgagc accgaattgt 
15481 tctcgagctg tttccacgag catccgctca cacattgaca taggccgtcg acagccagga 
15541 gttgccacct gaataaataa ttattcaatt taaacctaat ttagtaatgg taactttgta 
15601 aatgatggtg gatagctcat ataaaatttg aattggttct aaagttatac aaattttaat 
15661 ttcggtcaaa cttatgaact gtacttttga gttatactat tacaataata ttacccaaat 
15721 tattgtattc agatttttgt aatcagtact aacagatttt aggcaacgtc ctgccagaaa 
15781 catgggaata tatttgagca gtttttagta agttgccaca gcttgtataa gggaattgta 
15841 tcaaaatgta cttaatactt tctaagcact gacatagtga actacaaaag tcggtattat 
15901 acaatgccac tacaaataaa aatattcaga attcgactga aaaatgagaa aaggaacaac 
15961 tgaattggac acacgatgtc gtgattttca agaacacaaa aaaaagaaaa agaaatcgaa 
16021 aatgttgttt gcctcttttt ctttttctat atgagctaga atctcgaatg catgcctaat 
16081 ggagccactc gctctcgttt ctctaagtct cttctcacca gtcttttgtc caaaaattgc 
16141 gatgtcgcag gcgtcccgtt tccgccgctc acggagacac cactatcggc accagatgat 
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16201 cgtgtaaaga caccgtcttc gttgacttca attgctgaaa taagaggaat tagttttgaa 

16261 ttggaaatct gattaaataa aagtccccta ttcaatctaa ttaattttta aacacaaaac 

16321 ttactattgt ttaatggtgt tgacgaatta gaagaagttg attgaaatgc gttaacattc 

16381 cattcaaagc tatcttcatt ccattctgct cgttctttta tcctttcact cacgtctcga 

16441 gggatgaagt tttcaacaat aagaagcctg aaaacttata ttattctatt aaaaaaaatg 

16501 aacatcaaat cctaacgaaa gaaaattctt ctggggggaa aaggagagaa ttgtgagaat 

16561 aaagaacctg cgctgtcggt atcaaattac actatttgaa ttcaaattag aatacgaaag 

16621 aaagtgaaag aaatgaaaat gagtgagaat ctattaaatt gtaattgaga tatcactgaa 

16681 cttacttcaa cttcaattcc ttgctcactt ccgcaattgt ctgatccaga tcttgtcgtt 

16741 cgtcggaata tgctccagac acatcacgaa tctcattgcg agcctgaaac attcacaaac 

16801 cttatcttga cacctggtac atctgaagtc aaacctgtct caactttatc aacatcttct 

16861 ttaacttttt tgtctttgcc tccacctctg tccttaaatc cgaaaatgtt tgtttcaaat 

16921 ccacagtatc ctcttcctgc cgttccaatg cttcaaccat ttctctttcc cttcgttttt 

16981 gttcggcaag ttctcttctc ttcttttcca gttgtgcatg ttgttctttt gtccttgatt 

17041 ctaaccttcc atcttcttca gatcctacga taagtcgact ttgaatgttt gctattcttt 

17101 cagcaacccg tgcttgttct atccgttcct tttctaacag atcatgcttc tcttgaattt 

17161 ctcttattaa tcgatccttt tcatgtttta tcagtgaatc atctttttga attgcttcaa 

17221 tatcatcttc aagttttgct cgttcagcat cataaaaact ttgagttgct ccatccctcg 

17281 atcttgtctt tctctgcttc agttgctcac ggagcatttc aatttcttct tggaattctc 

17341 gcagtaaagc atccttagga tcttcattaa ttttcggttg attcttgatg tttttagctc 

17401 gatttgcata tcgtaatgta ccaagtgtct cctcaaaatt gtaacttgca ggtccaatac 

17461 aagcaaccat aactgtcttt gaatttccac cgagagaatc ttgaagaagt cgagtcagtt 

17521 ttgaatctcg ataaggaata tgggcagatt tcgcatccac caatgcactg attacatttc 

17581 caagagccga taatgaaaga ttgattttcg tagcttcttt aaatctttcg ccagttgctc 

1764 1 ctgttttcga ttgccgttct gaaccagcta aatctacaag atttagtcga ccaactgtaa 

17701 tatgactttc tccgtcttca ccaattcggg aacattcaac agtaatgata aagatagcgt 

17761 gggaacgaga cgaatgctca ttcatgttgg ttcgcctgaa aattttagta aaatcaaatc 

17821 caacggcgac cacagaaata acttacccta cagaccgatg cccatttcct cgaatcatca 

17881 cttcgtgtat ttcacctact gtccttgtta attttgactg aaactttgaa atttatagtc 

17 94 1 t gtcttctatt tcagaaaact atcacttacc gttaaatctt tcacataaac tcctccatct 

18001 ggacgttctt taatttctaa tttcttattc gattcggctt ctaataaatc tcgaagttcc 

18061 tcctaaagat ttcatttttg taaatcacac atcctaacgc cttacctgat aaatttccaa 

18121 atagctagct ctaactaaat actcttgatt atgtgatgct gccatgtgct caaaaatatg 

18181 gtcaatacac ttatagatga cacctcgttg ttctggatcc gatgattttc cttccattgt 

18241 gtgagtcttt ccagttccag tttgaccata tgcaaaaatc gtggcattat atccgtttag 

18301 aaccgaatca actagatctc gaaaggtttc ttcatataga tccgattgtg tggaactata 

18361 aaatatatat ttttaaaaaa gagaactcat aaaatcataa acataaaatt gtggagaaat 

18421 aattttgaaa aatactaata tttctatagc aggtgaaaaa aaagtgatgt actcctagaa 

18481 ataaataatc ttacttttca tcataaattg catcgaatgt aaaatccttc gatggctcat 

18541 cttgctcttt tggatttttc agctcaattt gcccacgttg tggtcgcata tgtactattc 

18601 tgaaaatgat cgaaatttca aaatataaat atttcaaact ttacttactt tgaataatta 

18661 tttgcaattt cttgtgaaga taacggtcga catctcacaa ttacctgtaa acataaataa 

18721 atacatttat ttgaatttgg aaatgtataa aactggatta tgaaattttt aagctggtgg 

18781 tttttgtatg agaagtaacg aaaaaaagta caatttactt agagtcttgt gatttttctt 

18841 tcaaatgcaa aactcaactg aatcataaat agtgatgctt cgaaaagttt ttagaggaaa 

18901 attgtatttt tagtaaaaac taatatacgt tttggactta aaaaaaaatt atgttaaaac 

18961 ttgaaaaatt acgtttatta gtgcttatat taaaatacgg tttcaaatta atttaaaatt 

19021 aaaataactc accttttggt caaaatcaga cattttagaa actagcatgt actttattac 

19081 gttgaatata acttatgttg gaaaatggaa aatttgaaga caggtgaatt ttagtttttt 

19141 ttctttttcg tacttctaaa aaatacttca tttattttac attttgagaa ctaatttttg 

19201 aacatgtttc gaacaaaaaa aaagattttg aaaaccccaa aaaaacttac tttgacagtc 

19261 tcctgttttg aagatttttt cattatttcc accatttttt gtcactaaat atttggctat 

19321 caatgtaggt gtcaaggaaa attttggtgc attcctgatt tagtgagagt ggtctggaac 

19381 ttaagaagat tagtttaatg tggaaaaata atcatattgt atcgagaaac ggaattttga 

19441 agcaataacc gctagagaaa gtgactaaaa accagaaatt gtagtcgtgg aatttcaata 

19501 tttttggttt tatgtcacat ctggacaatc ggaaaaatat gcatacattt gaaattttta 

19561 gaaatatttt gaattaactt taaaggaaaa aaatgcatta aaaagattga aaacatcatt 

19621 gacgttgaaa aatggagaaa atttctaatt tctcatcaaa atattaaaat attaaagttc 

19681 ttcaataata tgaaaatgtg aataaaatgt ctaaataagc aaaaaaaaca gatcctattc 

19741 attataaaat gttcacacaa gtgttacatt tcgtacaaag aagtactaaa acggatggac 

19801 taaagtaata ttgtcactcc cgaaaagacg aggaagaagt aatcggaaga agatgtcgga 

19861 agatgagtga tagtaaaaat acgaagagac gcagatagag agtttgagag aaggagactt 
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19921 
19981 
20041 
20101 
20161 
20221 
20281 
20341 
20401 
20461 
20521 
20581 
20641 
20701 
20761 
20821 
20881 
20941 
21001 
21061 
21121 
21181 
21241 
21301 
21361 
21421 
21481 
21541 
21601 
21661 
21721 
21781 
21841 
21901 
21961 
22021 
22081 
22141 
22201 
22261 
22321 
22381 
22441 
22501 
22561 
22621 
22681 
22741 
22801 
22861 
22921 
22981 
23041 
23101 
23161 
23221 
23281 
23341 
23401 
23461 
23521 
23581 



ctggaggaat 

caaaatgtgc 

gcatatgttt 

tatacatttt 

tcatagacat 

taccataaca 

ttgaaaaaat 

attagtagca 

taaggcatgc 

tattaattca 

aaaaaaaaga 

gaacaatttt 

agagaatcag 

cagtgaagca 

acaaatcatg 

ttgcttaggc 

ggaacagttc 

aaaaagtaga 

tcatcatatc 

aaattaaaaa 

gggaaaaaaa 

agaaaatcaa 

aatctcgatt 

tttgtagtat 

attcttccat 

tcccctttta 

tcaactgata 

ttttggaatc 

gatttttttg 

agtttttcac 

ttgacaaatc 

cagttaaatt 

caactgcttc 

acacctttca 

ttcttcaata 

cattgtatag 

ttaaacctcc 

taaaattatg 

tgccacgaca 

aagttacttt 

gcct cat t at 

ggcaatggga 

cccaaaaaca 

acaaatttct 

aaaactgtaa 

ttttaccgtg 

ggatcaattt 

taggccactc 

tgaaaacagg 

attgtgtaca 

gaaatgggaa 

gtaagttgag 

aattttggat 

gttggaagtg 

aagcctaggt 

ttactatgat 

attaatgttc 

aaaacaatcg 

aaactgacgg 

cgccgtgtgg 

ttcagtattg 

ttcagatacc 



aaaaggtggt 
ataatatcga 
ctagtgaaca 
aaacttgacg 
gtgtagattt 
ttgttgattt 
tcacagagaa 
tggcttctat 
gccagggcct 
agagactttt 
ttatttgcaa 
agacaggctg 
taaaatgttg 
gaaaagtgcg 
tgactggata 
caggctctag 
tctttgcact 
attttaaatt 
atgtgatcat 
tttactagaa 
caagaaaatt 
atcaaaatca 
ttttaagttg 
ttcagcctag 
aatttaatat 
aaatcgattc 
gtgagcagac 
taacatattc 
tcataagtat 
tatttttttt 
aactagtttt 
tttttaacga 
tttaaattct 
aaaaa tattt 
ttttcttctc 
actcaatttt 
attttgcaat 
ctgttttttt 
cagaggaaag 
tttaattctt 
aacataactt 
attgctcttt 
aaaatatcaa 
tgtcaaaatt 
gaaaatcgat 
ttagtaccgg 
tgagcaattt 
attcgtgttc 
gatgtagtcc 
cggattgtgc 
atcaaacatt 
ttttaatttt 
ctggtcttag 
agtgtatctg 
tgaacctgct 
atttcaaagc 
aacattacag 
attatacttc 
atgtgatcca 
gaaagtggct 
caacattttt 
ttcgaattat 



ttcaagatgg 

tcctgcgcag 

ctcagaagta 

tttttgaatt 

taaattaaaa 

atttaaaaat 

gttataatgt 

gtatatagac 

gaaaacgcat 

gaaacttgat 

aaattacaaa 

caatgaaagc 

cagattattt 

acaggagacc 

taattgaagg 

atattttatt 

ttctctatgg 

ataattttaa 

atcatataat 

ccaagcatat 

tcattaacat 

agtgagatta 

gattcttaca 

acggctgaga 

aattttcatt 

tccttttcaa 

accaacaacc 

aagaaaatta 

ttcttatttg 

tgagttctaa 

ttactcatat 

aattaaaatg 

ttaaaattca 

ctacattatt 

tgcgtt'tccc 

actttcgttt 

attaaaaatt 

ttcagaaagc 

aaaaatcagc 

tagacacacc 

cagcaattcc 

atttcggaca 

ttactctttc 

ttacattgta 

aattttgcta 

aa tgttctgc 

gttgtgaatc 

gtaattttcg 

aagtagtcaa 

tctttcaaat 

taaatgtgca 

taaatcatca 

gatgaaacga 

gtaaaaacac 

ctgcagttcc 

ccggggtacc 

ggcgaagtga 

aactcacgaa 

ttagaatatg 

gattcaatgt 

cattttttat 

tttatatcaa 



gggacagaga 

ttgagagacg 

gttgttcatg 

ataaagggat 

cacaactaga 

gagaaaaagt 

ttgatgatat 

tatttttata 

ctacctacca 

tttttgagat 

ttttaatgtc 

aatgaaataa 

gaaaatgcat 

gaagtctaaa 

tcttgattcg 

gaagcttttc 

ctcaactacc 

aatttaaata 

catatagggc 

gacaataaaa 

tattgaaaac 

ggaaagaatc 

cgattttttc 

attcttttca 

cgatattagc 

ctgactcatc 

atctctttag 

acttgaaatt 

gattataatt 

acacttcctt 

ctcacatcaa 

gtcatcggac 

atcaactttt 

tgcccacatc 

acactcttat 

t t t caat t t t 

tcaatattgc 

accaaaacat 

cgaaaggttt 

ttggaaacaa 

agttactctg 

taatggatgt 

ctggtaataa 

aaccgatagc 

caat ttcaca 

acttgagcct 

tgacaataat 

ttttccttga 

atattgattc 

ggaacacaag 

tatacaatta 

aagaaaacat 

cartgtaaca 

aattcgaaaa 

taatttttca 

atcttaaaat 

agttttatta 

acgatcaaca 

tggatgttaa 

ttaacggtgc 

aatacatcta 

tgataaagcc 



gggagagggt 

cagacaatgt 

tgtccgaaac 

ggaggtgctt 

cataggatga 

aaaattccgg 

tcactgattt 

tcacatacat 

ggggagctct 

tttattcaat 

tatactctga 

aatttccttg 

gcaagaattc 

aaagtgaatt 

gaaaagataa 

agaaatgttc 

agggcttttc 

ccaagcaaaa 

tcgttctttt 

tattttgaat 

atcgttggca 

gaaattaggt 

gggatatttt 

aacct tccaa 

at cca ttata 

acttaagaat 

tttccgttcc 

agaataaatg 

ttcatctcga 

cctcatcgat 

tctatgattt 

cgagcaaaag 

cgtgtccaaa 

ttggcacagt 

tttctgactg 

t ttttctgcg 

ccgttttggc 

gccaga tgat 

gaaaaattta 

caaaaactgc 

ataacaggag 

gagtttttag 

gtaagaaaaa 

aacaaaaaac 

aagctaaaaa 

tactattagt 

tagtcctatt 

acttgtaaag 

ttgtagcatt 

cttcacgaat 

ctttgaatga 

atgtatattt 

ttttgattaa 

tatttaacca 

cattattttt 

catcatttgc 

cggtctttcc 

gctacgtgga 

tggaactaaa 

atttcaattg 

acttcaaaat 

tcaaacgcgg 



taaaagagca 

gaagaatgga 

tttggaaaca 

caaaaagtaa 

atcagaagct 

atagtcttct 

gtaatacatt 

gaaaaagggt 

agctcttagt 

gattggttta 

catcggttta 

aaattataat 

gcagaaaat t 

atgaataaaa 

ttggagctct 

aaaattatca 

ctttttcttc 

aatcatatac 

ttttttcaaa 

tcactttaat 

ataggaatgt 

agaattggaa 

tcatttttat 

tttcaaagag 

tacgtatgat 

tgttgagtca 

gtttatttta 

tttcttgcta 

aatcgtagag 

gatgaagttt 

tattcaaaaa 

ctttcagaat 

gtcacaaact 

tttcttgcca 

ttgacttttc 

aagttcggtg 

ttgaatctat 

attccaaaat 

gaaaaatctg 

ctgctttacg 

tagccacgtt 

agtttattat 

gctaaagaaa 

aagtgtcata 

atatttttta 

tacacaaaat 

gatatagctt 

gtacagtttt 

agaacaagag 

tatgagaact 

cgattatact 

ttgcgaagga 

agagccctta 

aatatgtata 

cttcaaaata 

aagtatcaca 

aagttctatc 

aaagttactg 

gttcccattg 

attgcttaat 

ttgttttttt 

taacacgggt 
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hi 

I 



23641 
23701 
23761 
23821 
23881 
23941 
24001 
24061 
24121 
24181 
24241 
24301 
24361 
24421 
24481 
24541 
24601 
24661 
24721 
24781 
24841 
24901 
24961 
25021 
25081 
25141 
25201 
25261 
25321 
25381 
25441 
25501 
25561 
25621 
25681 
25741 
25801 
25861 
25921 
25981 
26041 
26101 
26161 
26221 
26281 
26341 
26401 
26461 
26521 
26581 
26641 
26701 
26761 
26821 
26881 
26941 
27001 
27061 
27121 
27181 
27241 
27301 



tccatggaca 
gattcgagcg 
aaaataaatt 
tctgtcaatt 
attttatggt 
tgaatcgaca 
ttaattacag 
aaaagttcac 
cagataaaca 
atatttcttt 
agcgaaattg 
cttgacaata 
atttttgatc 
actatccagt 
ttggacccag 
tcgttactat 
catgaattgg 
tgaagtctca 
ttgtaaataa 
ttgaaataca 
tgagattaat 
ttgtaagtgc 
aatttaaaaa 
agtaaagcta 
gatttcaacg 
tcaacagttg 
tgttttgaac 
tctcatctac 
acgatttttg 
gttgaatctc 
gctcagtgat 
taacattttc 
gcaaccattg 
caagactttc 
tgacctcctg 
tttcttgttc 
ttttaagtgt 
tcacatgact 
cagagacct t 
tcatgttctc 
tgattgtaac 
aattgttgaa 
cattccgatc 
ttaaattctg 
gataaacatt 
aaaacagaat 
tttccaaaga 
caacttcgtc 
gtataggatc 
agtggaaaat 
ccatcaactt 
ccggatatgg 
ttgagaaatc 
cttttatttg 
cgattaat t t 
cattttgaaa 
gagtcgaaat 
ctggagtcca 
attaagtaac 
actttgaatt 
ataaaaatta 
cacttcaact 



actcgtggag 
gaaaaccaga 
aaactccatc 
gggactaaac 
ttggatgaag 
agttgatatg 
tatgtttatg 
aataattcca 
gtggctccaa 
attatggttc 
tcaaaatttt 
gtaaagaaaa 
aatttgaaaa 
ggatatgtat 
gaatctgttt 
tctcttcata 
aaaaactaat 
aaataagctg 
aataaaatgt 
taataccccc 
tgtcgagtga 
tatcagatat 
atttatcaaa 
atttctatta 
ggaagaaggc 
aatataagta 
aatatcgaga 
agcttccaat 
aatttcctgg 
tttaaattgc 
ttctcgagtt 
ttcaagtttt 
cgggtcattt 
tgtctgttca 
gtttgcctct 
caaatgggga 
cagttctcga 
ctttaatttg 
cacttgacaa 
attcgaatca 
attttgcttt 
attttcgagc 
attatttagt 
gtgttgaagt 
gaaggat tct 
attctttctt 
tgtaaccagc 
ttcaagtttt 
cttgaaataa 
gtcataataa 
ctcttgtttc 
cgacaaatca 
ttcatttctc 
gaaaatttcg 
ctcaacttct 
atcttctagt 
ttcagtgttc 
aaacttgtct 
atctttaaga 
atcaatctga 
cagcttttga 
ccattctctt 



tactcggtgc 
cattatgtga 
ttttagggaa 
agtattgatc 
gttgcagctc 
ttcagtaatg 
ttaatgttga 
cccaaacctg 
aaaaccaatc 
aaacttttga 
tcaaaaccaa 
aacaagttga 
taatcaatag 
tcgggcgaca 
ctaccagtaa 
ttgatttggt 
tgaaaataga 
gtagcatgta 
tattatatta 
ttcagtttat 
gggcatctga 
aacaaatgat 
aaacaaaaaa 
aagttgatct 
caaatcagca 
gaaactctac 
tcgctccatc 
tctccaatta 
atatttgctt 
tctgaaatct 
ctttctttca 
tcaatctttc 
cgcatagttt 
agtttcagtt 
tcggaaggat 
tatgttctac 
acattacgct 
acaatctcat 
tcttttgtct 
atcgtttttt 
gaaatttcac 
atcaaacttt 
tcagttgata 
tccaaaaatt 
tgaagcattt 
ttttggaatt 
tgcgatggat 
tgaatgatct 
agagaaaacg 
tctccatcat 
ggtgtttcat 
taaaattgtt 
attatcaaca 
aattgcattt 
tcttttaatt 
gttgcaaact 
ttgtttattc 
acttccaaaa 
tattcaggtg 
attcttctgc 
aaccaatgaa 
ccaccgctgc 



aactgaaatg 
tgtgtttgcg 
caatgcctcc 
cagatgttgg 
ttccaaaatt 
gattacctaa 
atttatgtat 
cttaaatatg 
tttgttataa 
gaaaaaaggg 
atttgatttt 
atttttctat 
acaatttttc 
agtacttcgt 
tctatttggt 
taaaacagag 
cggatgaact 
ttgtacggga 
gtctgtaatt 
caattaaagc 
aatgtaaatt 
catttaatta 
aaacggtcaa 
aaatactggt 
agtgtacatt 
ctattgctaa 
catcggataa 
tctgatcctt 
cataaactct 
tcttctcaag 
taatcatttc 
gtacataatc 
tgagtcctcc 
tttctaatgt 
cagaaatttt 
tagtcccttg 
ccaatttcgg 
cattcttatc 
tccatgacaa 
tcatatcttt 
gtttcacaga 
caacaccttc 
tagaattggt 
t ctccatgac 
taatctcttc 
ccactgtcac 
tcaaactttc 
gtttgttgtc 
tgactatgat 
ttaatacttt 
caccaattaa 
tttgaagatc 
acttcttcaa 
cagctagtat 
ttttgatatt 
gatattcagt 
gttttacttc 
aatgtgtttt 
cacttacatt 
ggttccagtg 
acgaaacaac 
ttcttttttc 



aaaagaaaat 
gttgaaatga 
gccatcatgg 
cattggtttc 
cagaaaactg 
aggacaatat 
t tatgcaaaa 
gagatgcaag 
aacctcacaa 
aaatttagaa 
ccagatttat 
atgaattctt 
tccatactac 
tatagccaat 
tgttggaaca 
actgtcgagg 
tcaaatttgt 
acagatttgt 
ttatgtatag 
tccaactatc 
taaaattaca 
aggaggaaaa 
atattttttc 
tgtgtaggca 
gtgttctgaa 
catttattgc 
t tccgtatga 
cagtttcagc 
cgagattt ca 
aactcttttc 
agtattcatt 
acagaaatga 
gggttgttca 
ttgctcaagt 
ttcagaatgg 
actagaagtc 
agaatttcca 
aaagatttga 
ttcatctgcc 
tatcatgtct 
attcaaagca 
ggatttatca 
atcatcaact 
tttttcaata 
attctttttc 
taagagataa 
caagtttttc 
ttccatcttt 
cttgtcaata 
taatttgtcc 
tattccaccg 
ctcatattta 
tcttctagct 
tatctcatct 
ttgatctttt 
ggatgtactc 
ggagcttctg 
tttgcttctg 
tttgaaatat 
aagcaagcat 
tattgtattt 
acacttttcc 



tcagaaatcc 
aataagaaaa 
agatatccga 
gagaacattg 
tatagaatac 
cagttgacca 
aatttactgc 
ttttttgttt 
aaatttctcg 
aattctttca 
tttttgtcga 
atagctgaac 
tgattttcag 
gaaaactggg 
ttcttacttc 
gt t cat ccaa 
ttacaagagt 
atactttgct 
ttcaatttaa 
attcgctggt 
aaataaataa 
acaaaacatt 
aaacaaacaa 
tactatagtt 
aaattgaaat 
aattcttctg 
tttgatgaca 
ttaatatcaa 
gatttaactt 
tgataatata 
tgttcttctt 
tcaaccacct 
agaattgcga 
gggaattctc 
aacgtcaaaa 
tcactagaga 
gattcagtgc 
ttttccagtg 
aacatcttga 
tcatcagttt 
attttcaaag 
ttcttggcaa 
gagaatactg 
ctatccttct 
ttgagttcgg 
tcatctttgt 
gtaagatagt 
tttgttataa 
gtttccagca 
aaaagttgat 
taaccgttaa 
gttttgagaa 
tctgtctctg 
tcttgttcat 
tctgccatag 
gtggattctt 
actattttct 
aaaaacatat 
ttggtgacaa 
aattctgaaa 
aaaaaatgct 
agtttatcaa 
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27361 
27421 
27481 
27541 
27601 
27661 
27721 
27781 
27841 
27901 
27961 
28021 
28081 
28141 
28201 
28261 
28321 
28381 
28441 
28501 
28561 
28621 
28681 
28741 
28801 
28861 
28921 
28981 
29041 
29101 
29161 
29221 
29281 
29341 
29401 
29461 
29521 
29581 
29641 
29701 
29761 



ttaaaaattc 
tgaaaatagt 
aaaaacaagt 
gaaaaataat 
tcgttattgg 
ccgagagtaa 
ttaactttta 
gtacataaca 
aaattagcga 
tttcacttgt 
ttctagctaa 
attgaaatt t 
tcagtcaatt 
gcaaattttg 
aa taaccaca 
gacagtggct 
aagttgggaa 
tataagtcga 
attaacattt 
ttttatcttt 
agatgttagg 
gttctttttt 
t ccgaa ttga 
catcgagttc 
gctctgaaaa 
taatactaac 
agaaacagaa 
gtaaaatgtt 
atgaagttta 
tttttaaaaa 
ccaatacgaa 
gaaaaatcgg 
tctgaatctt 
gcttctcctt 
aaatccttct 
tgaataagtt 
t ttgcaatct 
aa tgct taca 
tcaaattgcc 
gatgtattaa 
acggtcgagt 



aagtttctgt 
taatttaaat 
actaagagaa 
gcaatagcaa 
gatcatcata 
aggcaaaaaa 
taacatctag 
gctagctgaa 
ttctgagcta 
acataacagc 
attagcga tt 
tcacttgtac 
tcagctaaat 
atggtttttc 
agctgaaact 
tggaattaag 
gaatgtaggt 
actctttcgt 
aaggagcaaa 
tgtaagtgtt 
aaaatcgaat 
aagatgtttt 
gcaatttcaa 
ccaacaatac 
taatcggaaa 
tttattgaga 
tat ccctgag 
ttcacattag 
taatacagtg 
taattttttg 
gtt gaat t ta 
ggcgcatttt 
cactgtgtgt 
cttgattgat 
ctctcaatgc 
ccatcagaca 
gttcttcttt 
gatgtgtgta 
ccaataagtt 
gcgtggcttt 
tatgttgatc 



tgttcaggtg 
gtagcagaaa 
attgaataaa 
agaaaaacga 
tcatttattg 
tagcatggag 
ttgacactgt 
actgtaattt 
agccttcatt 
tagctgaaac 
ctgagctaag 
ataacagcta 
tagcaatttt 
gtgttcagtg 
gtgagataat 
actagaatga 
cattgtaaag 
ttggtaactt 
ggattgcaaa 
gccataattt 
ttgtctgact 
catacaaatt 
aactattttt 
tgtatggata 
ggaaatgaga 
aagaacataa 
aataagtgaa 
tataaacaat 
cagtacagaa 
ttgaaggctt 
aaatttaaaa 
tagtgcaaaa 
ggatttattg 
aagaagacct 
atcatgcatc 
ctcaattttg 
tgatt cacat 
aaacccctag 
gtgcttccac 
ttcctgctga 
caaaatatga 



aaggctgaga 
aatcttttct 
ccaatcacaa 
gtagtttcgg 
atgaggatat 
tgaaaaaacg 
cagaccaaaa 
aattttatat 
tcaaaattaa 
tgtaat ttaa 
acttcatttc 
gctgaaactg 
gagctaagtg 
aacaaacaaa 
tttttagtga 
ttatctctca 
cgacagacag 
gagggccaat 
caaaatgcga 
cagtaacgaa 
agccaacgaa 
agtcagtttt 
tgtaaaataa 
gaaaa tacct 
accttttaaa 
ttgttatgag 
gatacttgaa 
gacagagtca 
aaggtacaaa 
aaggtaatac 
ggaaaaaagg 
aattagatgg 
tcgtcatctt 
tgcagttttt 
ttctgaattt 
ctatcttttt 
tctctctttg 
aaaactttca 
tcttcagttc 
gtgtttgcaa 
ctg 



tgctgtgaac 
agaaagtaaa 
taatgacttc 
taactccata 
tatgagttaa 
gatcaagcaa 
acttaataaa 
tccctcggtc 
caaaaaaaat 
ttttatattc 
aaaaataaca 
caatttaatt 
ttgttgtttc 
caaacaaaaa 
ccattgagtg 
tgataacata 
gtcgcattga 
gttatttgct 
taagatatat 
aaaaataaca 
tgttctcaat 
cgaagct t ca 
aatacatctc 
accaatactg 
tataaaatga 
aatagttttt 
aatttgtgaa 
cgcaaaagta 
gtttacaaga 
gattaaagag 
aaaaaaat ta 
catttatttt 
gatcgatcat 
ccgaaagttc 
cagcggatcg 
ccatgatttc 
aattggctga 
caagct tacc 
gaagtttaag 
gttgcatctc 



gacatagttc 
aaaaatcagt 
ttaacaagct 
gtacattatt 
ttctaataac 
agaaatcgtg 
attttcactt 
aattctagct 
gcaatgaaat 
tctcggtcaa 
aaaaaaatga 
ttatattcct 
ttaaaacaat 
aattctggta 
actgctcata 
ttatacagag 
tcaaagagaa 
attagggaaa 
gattatagta 
aggcaatttt 
tgaagttatt 
gccacactta 
cgaaaat tta 
cacatgaaac 
gcacaataag 
aaatgaggtg 
atagtaataa 
cgggaaacat 
atacaattgt 
ctactttctt 
aaaagcatat 
atcccatcca 
tgtatcatca 
cgaaatcttc 
ttcgctattt 
tcttttatga 
aataaaagaa 
tgtcaatact 
atcttcaact 
taacgccatg 
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